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Description 

FIELD OF THE INVENTION 

5 [0001] This invention relatestothefieldof plantbiology. More particularly, the present invention pertainsto compositions 
and methods for phenotypically modifying a plant. 

BACKGROUND OF THE INVENTION 

10 [0002] Transcription factors can modulate gene expression, either increasing or decreasing (inducing or repressing) 
the rate of transcription, This modulation results in differential levels of gene expression at various developmental stages, 
in different tissues and cell types, and in response to different exogenous (e.g., environmental) and endogenous stimuli 
throughout the life cycle of the organism. 

[0003] Kasuga et al., 1999, Nature Biotechnology, Vol. 17, pp. 287-291, describes gene transfer of a single stress- 
's inducible transcription factor. 

[0004] Because transcription factors are key controlling elements of biological pathways, altering the expression levels 
of one or more transcription factors can change entire biological pathways in an organism. For example, manipulation 
of the levels of selected transcription factors may result in increased expression of economically useful proteins or 
metabolic chemicals in plants or to improve other agriculturally relevant characteristics. Conversely, blocked or reduced 
20 expression of a transcription factor may reduce biosynthesis of unwanted compounds or remove an undesirable trait. 
Therefore, manipulating transcription factor levels in a plant offers tremendous potential in agricultural biotechnology for 
modifying a plant's traits. 

[0005] Described herein are novel transcription factors useful for modifying a plant's phenotype in desirable ways, 
such as modifying a plant's environmental stress tolerance. 

25 

SUMMARY OF THE INVENTION 

[0006] In one aspect, the invention is a transgenic plant as defined in the claims. 

[0007] The plant may be a soybean, wheat, corn, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
30 banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, 
grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet corn, 
tobacco, tomato, watermelon, rosaceous fruits, or vegetable brassicas plant. 

[0008] Furthermore, the invention relates to a method for producing a plant having improved environmental stress 
tolerance, the method being as defined in the claims. 

35 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0009] 

40 Figure 1 provides a table of exemplary polynucleotide and polypeptide sequences. The table includes from left to 

right for each sequence: the SEQ ID No., the internal code reference number (GID), whether the sequence is a 
polynucleotide or polypeptide sequence, and identification of any conserved domains for the polypeptide sequences. 
Figure 2 provides a table of exemplary sequences that are homologous to other sequences provided in the Sequence 
Listing and that are derived from Arabidopsis thaliana. The table includes from left to right: the SEQ ID No., the 

45 internal code reference number (GID), identification of the homologous sequence, whether the sequence is a poly- 

nucleotide or polypeptide sequence, and identification of any conserved domains for the polypeptide sequences. 
Figure 3 provides a table of exemplary sequences that are homologous to the sequences provided in Figures 1 and 
2 and that are derived from plants other than Arabidopsis thaliana. The table includes from left to right: the SEQ ID 
No., the internal code reference number (GID), the unique GenBank sequence ID No. (NID), the probability that the 

50 comparison was generated by chance (P-value), and the species from which the homologous gene was identified. 

DETAILED DESCRIPTION 

[0010] The present invention relates to polynucleotides and polypeptides, e.g. for modifying phenotypes of plants. 
55 [0011] In particular, the polynucleotides or polypeptides are useful for modifying traits associated with a plant's envi- 
ronmental stress tolerance when the expression levels of the polynucleotides or expression levels or activity levels of 
the polypeptides are altered. Specifically, the polynucleotides and polypeptides are useful for modifying traits associated 
with a plant's environmental stress tolerance, such as freezing, chilling, heat, drought, water saturation, salt, photocon- 
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ditions, radiation and ozone, or the like. Plants with altered expression of the polynucleotides or polypeptides of the 
invention are more tolerant to these environmental stresses compared with plants without altered expression levels. 
[0012] The polynucleotides disclosed herein encode plant transcription factors. The plant transcription factors are 
derived, e.g., from Arabidopsis thaliana and can belong, e.g., to one or more of the following transcription factor families: 

5 the AP2 (APETALA2) domain transcription factorfamily (Riechmann and Meyerowitz (1 998) J. Biol. Chem. 379:633-646); 
the MYB transcription factorfamily (Martin and Paz-Ares (1 997) Trends Genet. 1 3:67-73); the MADS domain transcription 
factorfamily (Ritchmann and Meyerowitz (1997) J. Biol. Chem 378:1 079-11 01); the WRKY protein family (Ishiguro and 
Nakamura (1994) Mol. Gen. Genet. 244:563-571); the ankyrin-repeat protein family (Zhang et al. (1992) Plant Cell 4: 
1575-1588); the miscellaneous protein (MISC) family (Kim et al. (1997) Plant J. 11:1237-1251); the zinc finger protein 

10 (Z) family (Klug and Schwabe (1 995) FASEB J. 9: 597-604); the homeobox (HB) protein family (Duboule ( 1 994) Guidebook 
to the Homeobox Genes, Oxford University Press); the CAAT-element binding proteins (Forsburg and Guarente (1 989) 
Genes Dev. 3:1166-1178); the squamosa promoter binding proteins (SPB) (Klein et al. (1996) Mol. Gen. Genet. 1996 
250:7-16), the NAM protein family; the IAA/AUX proteins (Rouse et al. (1998) Science 279:1371-1373); the HLH/MYC 
protein family (Littlewood et al. (1994) Prot. Profile 1:639-709); the DNA-binding protein (DBP) family (Tucker et al. 

15 (1994) EMBO J. 1 3:2994-3002); the bZIP family of transcription factors (Foster et al. (1994) FASEB J. 8:1 92-200); the 
BPF-I protein (Box P-binding factor) family (da Costa e Silva et al. (1993) Plant J. 4:125-135); and the golden protein 
(GLD) family (Hall et al. (1998) Plant Cell 10:925-936). 

[001 3] In addition to methods for modifying a plant phenotype by employing one or more polynucleotides and polypep- 
tides described herein, the polynucleotides and polypeptides have a variety of additional uses. These uses include their 
20 use in the recombinant production (i.e, expression) of proteins; as regulators of plant gene expression, as diagnostic 
probes for the presence of complementary or partially complementary nucleic acids (including for detection of natural 
coding nucleic acids); as substrates for further reactions, e.g., mutation reactions, PCR reactions, or the like, of as 
substrates for cloning e.g., including digestion or ligation reactions, and for identifying exogenous or endogenous mod- 
ulators of the transcription factors. 

25 

DEFINITIONS 

[0014] A "polynucleotide" is a nucleic acid sequence comprising a plurality of polymerized nucleotide residues, e.g., 
at least about 15 consecutive polymerized nucleotide residues, optionally at least about 30 consecutive nucleotides, at 

30 least about 50 consecutive nucleotides. In many instances, a polynucleotide comprises a nucleotide sequence encoding 
a polypeptide (or protein) or a domain or fragment thereof. Additionally, the polynucleotide may comprise a promoter, 
an intron, an enhancer region, a polyadenylation site, a translation initiation site, 5' or 3' untranslated regions, a reporter 
gene, a selectable marker, or the like. The polynucleotide can be single stranded or double stranded DNA or RNA. The 
polynucleotide optionally comprises modified bases or a modified backbone. The polynucleotide can be, e.g., genomic 

35 DNA or RNA, a transcript (such as an mRNA), a cDNA, a PCR product, a cloned DNA, a synthetic DNA or RNA, or the 
like. The polynucleotide can comprise a sequence in either sense or antisense orientations. 

[0015] A "recombinant polynucleotide" is a polynucleotide that is not in its native state, e.g., the polynucleotide com- 
prises a nucleotide sequence not found in nature, or the polynucleotide is in a context other than that in which it is 
naturally found, e.g., separated from nucleotide sequences with which it typically is in proximity in nature, or adjacent 
40 (or contiguous with) nucleotide sequences with which it typically is not in proximity. For example, the sequence at issue 
can be cloned into a vector, or otherwise recombined with one or more additional nucleic acid. 

[0016] An "isolated polynucleotide" is a polynucleotide whether naturally occurring or recombinant, that is present 
outside the cell in which it is typically found in nature, whether purified or not. Optionally, an isolated polynucleotide is 
subject to one or more enrichment or purification procedures, e.g., cell lysis, extraction, centrifugation, precipitation, or 
45 the like. 

[0017] A "recombinant polypeptide" is a polypeptide produced by translation of a recombinant polynucleotide. An 
"isolated polypeptide," whether a naturally occurring or a recombinant polypeptide, is more enriched in (or out of) a cell 
than the polypeptide in its natural state in a wild type cell, e.g., more than about 5% enriched, more than about 10% 
enriched, or more than about 20%, or more than about 50%, or more, enriched, i.e., alternatively denoted: 1 05%, 110%, 

50 120%, 150% or more, enriched relative to wild type standardized at 100%. Such an enrichment is not the result of a 
natural response of a wild type plant. Alternatively, or additionally, the isolated polypeptide is separated from other 
cellular components with which it is typically associated, e.g., by any of the various protein purification methods herein. 
[0018] The term "transgenic plant" refers to a plant that contains genetic material, not found in a wild type plant of the 
same species, variety or cultivar. The genetic material may include a transgene, an insertional mutagenesis event (such 

55 as by transposon orT-DNAinsertional mutagenesis), an activation taggingsequence, a mutated sequence, a homologous 
recombination event or a sequence modified bychimeraplasty. Typically, the foreign genetic material has been introduced 
into the plant by human manipulation. 

[0019] A transgenic plant may contain an expression vector or cassette. The expression cassette typically comprises 
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a polypeptide-encoding sequence operably linked (i.e., under regulatory control of) to appropriate inducible orconstitutive 
regulatory sequences that allow for the expression of polypeptide. The expression cassette can be introduced into a 
plant by transformation or by breeding after transformation of a parent plant. A plant refers to a whole plant as well as 
to a plant part, such as seed, fruit, leaf, or root, plant tissue, plant cells or any other plant material, e.g., a plant explant, 

5 as well as to progeny thereof, and to in vitro systems that mimic biochemical or cellular components or processes in a cell. 
[0020] The phrase "ectopically expression or altered expression" in reference to a polynucleotide indicates that the 
pattern of expression in, e.g., a transgenic plant or plant tissue, is different from the expression pattern in a wild type 
plant or a reference plant of the same species. For example, the polynucleotide or polypeptide is expressed in a cell or 
tissue type other than a cell or tissue type in which the sequence is expressed in the wild type plant, or by expression 

10 at a time other than at the time the sequence is expressed in the wild type plant, or by a response to different inducible 
agents, such as hormones or environmental signals, or at different expression levels (either higher or lower) compared 
with those found in a wild type plant. The term also refers to altered expression patterns that are produced by lowering 
the levels of expression to below the detection level or completely abolishing expression. The resulting expression 
pattern can be transient or stable, constitutive or inducible. In reference to a polypeptide, the term "ectopic expression 

is or altered expression" further may relate to altered activity levels resulting from the interactions of the polypeptides with 
exogenous or endogenous modulators or from interactions with factors or as a result of the chemical modification of the 
polypeptides: 

[0021] The term "fragment" or "domain," with respect to a polypeptide, refers to a subsequence of the polypeptide. In 
some cases, the fragment or domain, is a subsequence of the polypeptide which performs at least one biological function 

20 of the intact polypeptide in substantially the same manner, or to a similar extent, as does the intact polypeptide. For 
example, a polypeptide fragment can comprise a recognizable structural motif or functional domain such as a DNA 
binding domain that binds to a DNA promoter region, an activation domain or a domain for protein-protein interactions. 
Fragments can vary in size from as few as 6 amino acids to the full length of the intact polypeptide, but are preferably 
at least about 30 amino acids in length and more preferably at least about 60 amino acids in length. In reference to a 

25 nucleotide sequence, "a fragment" refers to any subsequence of a polynucleotide, typically, of at least consecutive about 
15 nucleotides, preferably at least about 30 nucleotides, more preferably at least about 50, of any of the sequences 
provided herein. 

[0022] The term "trait" refers to a physiological, morphological, biochemical or physical characteristic of a plant or 
particular plant material or cell. In some instances, this characteristic is visible to the human eye, such as seed or plant 
30 size, or can be measured by available biochemical techniques, such as the protein, starch or oil content of seed or 
leaves or by the observation of the expression level of genes, e.g., by employing Northern analysis, RT-PCR, microarray 
gene expression assays or reporter gene expression systems, or by agricultural observations such as stress tolerance, 
yield or pathogen tolerance. 

[0023] "Trait modification" refers to a detectable difference in a characteristic in a plant ectopically expressing a 
35 polynucleotide or polypeptide of the present invention relative to a plant not doing so, such as a wild type plant. In some 
cases, the trait modification can be evaluated quantitatively. For example, the trait modification can entail at least about 
a 2% increase or decrease in an observed trait (difference), at least a 5% difference, at least about a 10% difference, 
at least about a 20% difference, at least about a 30%, at least about a 50%, at least about a 70%, or at least about a 
1 00%, or an even greater difference. It is known that there can be a natural variation in the modified trait. Therefore, the 
40 trait modification observed entails a change of the normal distribution of the trait in the plants compared with the distribution 
observed in wild type plant. 

[0024] Trait modifications of particular interest include those to seed ( such as embryo or endosperm), fruit, root, 
flower, leaf, stem, shoot, seedling orthe like, including: enhanced tolerance to environmental conditions including freezing, 
chilling, heat, drought, water saturation, radiation and ozone; improved tolerance to microbial, fungal or viral diseases; 

45 improved tolerance to pest infestations, including nematodes, mollicutes, parasitic higher plants or the like; decreased 
herbicide sensitivity; improved tolerance of heavy metals or enhanced ability to take up heavy metals; improved growth 
under poor photoconditions (e.g., low light and/or short day length), or changes in expression levels of genes of interest. 
Other phenotype that can be modified relate to the production of plant metabolites, such as variations in the production 
of taxol, tocopherol, tocotrienol, sterols, phytosterols, vitamins, wax monomers, anti-oxidants, amino acids, lignins, 

50 cellulose, tannins, prenyllipids (such as chlorophylls and carotenoids), glucosinolates, and terpenoids, enhanced or 
compositionally altered protein or oil production (especially in seeds), or modified sugar (insoluble or soluble) and/or 
starch composition. Physical plant characteristics that can be modified include cell development (such as the number 
of trichomes), fruit and seed size and number, yields of plant parts such as stems, leaves and roots, the stability of the 
seeds during storage, characteristics of the seed pod (e.g., susceptibility to shattering), root hair length and quantity, 

55 internode distances, or the quality of seed coat. Plant growth characteristics that can be modified include growth rate, 
germination rate of seeds, vigor of plants and seedlings, leaf and flower senescence, male sterility, apomixis, flowering 
time, flower abscission, rate of nitrogen uptake, biomass or transpiration characteristics, as well as plant architecture 
characteristics such as apical dominance, branching patterns, number of organs, organ identity, organ shape or size. 
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POLYPEPTIDES AND POLYNUCLEOTIDES 

[0025] Described herein are among other things, transcription factors (TFs), and transcription factor homologue 
polypeptides, and isolated or recombinant polynucleotides encoding the polypeptides. These polypeptides and polynu- 

5 cleotides may be employed to modify a plant's environmental stress tolerance. 

[0026] Exemplary polynucleotides encoding the polypeptides were identified in the Arabidopsis thaliana GenBank 
database using publicly available sequence analysis programs and parameters. Sequences initially identified were then 
further characterized to identify sequences comprising specified sequence strings corresponding to sequence motifs 
present in families of known transcription factors. Polynucleotide sequences meeting such criteria were confirmed as 

10 transcription factors. 

[0027] Additional polynucleotides were identified by screening Arabidopsis thaliana and/or other plant cDNA libraries 
with probes corresponding to known transcription factors under low stringency hybridization conditions. Additional se- 
quences, including full length coding sequences were subsequently recovered by the rapid amplification of cDNA ends 
(RACE) procedure, using a commercially available kit according to the manufacturer's instructions. Where necessary, 
is multiple rounds of RACE are performed to isolate 5' and 3' ends. The full length cDNA was then recovered by a routine 
end-to-end polymerase chain reaction (PCR) using primers specific to the isolated 5' and 3' ends. Exemplary sequences 
are provided in the Sequence Listing. 

[0028] The polynucleotides were ectopically expressed in overexpressor or knockout plants and changes in the en- 
vironmental stress tolerance of the plants was observed. Therefore, the polynucleotides and polypeptides can be em- 
20 ployed to improve the environmental stress resistance of plants. 

Making polynucleotides 

[0029] The polynucleotides include sequences that encode transcription factors and transcription factor homologue 
25 polypeptides and sequences complementary thereto, as well as unique fragments of coding sequence, or sequence 
complementary thereto. Such polynucleotides can be, e.g., DNA or RNA, e.g., mRNA, cRNA, synthetic RNA, genomic 
DNA, cDNA synthetic DNA, oligonucleotides, etc. The polynucleotides are either double-stranded or single-stranded, 
and include either, or both sense (i.e., coding) sequences and antisense (i.e., non-coding, complementary) sequences. 
The polynucleotides include the coding sequence of a transcription factor, or transcription factor homologue polypeptide, 
30 in isolation, in combination with additional coding sequences (e.g., a purification tag, a localization signal, as a fusion- 
protein, as a pre-protein, or the like), in combination with non-coding sequences (e.g., introns or inteins, regulatory 
elements such as promoters, enhancers, terminators, and the like), and/or in a vector or host environment in which the 
polynucleotide encoding a transcription factor or transcription factor homologue polypeptide is an endogenous or exog- 
enous gene. 

35 [0030] A variety of methods exist for producing the polynucleotides. Procedures for identifying and isolating DNA 
clones are well known to those of skill in the art, and are described in, e.g., Berger and Kimmel, Guide to Molecular 
Cloning Techniques, Methods in Enzymology volume 152 Academic Press, Inc., San Diego, CA ("Berger"); Sambrook 
et al., Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1 -3, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology, F.M. Ausubel et al., eds., Current Protocols, 

40 a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 2000) 
("Ausubel"). 

[0031] Alternatively, polynucleotides can be produced by a variety of in vitro amplification methods adapted to the 
present invention by appropriate selection of specific or degenerate primers. Examples of protocols sufficient to direct 
persons of skill through in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain 

45 reaction (LCR), Qbeta-replicase amplification and other RNA polymerase mediated techniques (e.g., NASBA), e.g., for 
the production of the homologous nucleic acids of the invention are found in Berger, Sambrook, and Ausubel, as well 
as Mullis et al., (1987) PCR Protocols A Guide to Methods and Applications (Innis et al. eds) Academic Press Inc. San 
Diego, CA (1990) (Innis). Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al., 
U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are summarized in Cheng et al. 

50 (1 994) Nature 369: 684-685 and the references cited therein, in which PCR amplicons of up to 40kb are generated. One 
of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable for restriction 
digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, e.g., Ausubel, Sambrook 
and Berger, ali supra. 

[0032] Alternatively, polynucleotides and oligonucleotides can be assembled from fragments produced by solid-phase 
55 synthesis methods. Typically, fragments of up to approximately 1 00 bases are individually synthesized and then enzy- 
matically or chemically ligated to produce a desired sequence, e.g., a polynucleotide encoding all or part of a transcription 
factor. For example, chemical synthesis using the phosphoramidite method is described, e.g., by Beaucage et al. (1981) 
Tetrahedron Letters 22: 1 859-69; and Matthes et al. (1 984) EM BO J. 3:801 -5, Accordingto such methods, oligonucleotides 
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are synthesized, purified, annealedtotheircomplementary strand, ligated and then optionally cloned into suitable vectors. 
And if so desired, the polynucleotides and polypeptides of the invention can be custom ordered from any of a number 
of commercial suppliers. 

5 HOMOLOGOUS SEQUENCES 

[0033] Sequences homologous, i.e., that share significant sequence identity or similarity, to those provided in the 
Sequence Listing, derivedfrom Arabidopsis thaliana or from otherplants of choice are also described herein. Homologous 
sequences can be derived from any plant including monocots and dicots and in particular agriculturally important plant 

10 species, including but not limited to, crops such as soybean, wheat, corn, potato, cotton, rice, oilseed rape (including 
canola), sunflower, alfalfa, sugarcane and turf; or fruits and vegetables, such as banana, blackberry, blueberry, straw- 
berry, and raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, mango, 
melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous 
fruits (such as apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, 

is brussel sprouts and kohlrabi). Other crops, fruits and vegetables whose phenotype can be changed include barley, rye, 
millet, sorghum, currant, avocado, citrus fruits such as oranges, lemons, grapefruit and tangerines, artichoke, cherries, 
nuts such as the walnut and peanut, endive, leek, roots, such as arrowroot, beet, cassava, turnip, radish, yam, and 
sweet potato, and beans. The homologous sequences may also be derived from woody species, such pine, poplar and 
eucalyptus. 

20 [0034] Transcription factors that are homologous to the listed sequences will typically share at least about 30% amino 
acid sequence identity. More closely related transcription factors can share at least about 50%, about 60%, about 65%, 
about 70%, about 75% or about 80% or about 90% or about 95% or about 98% or more sequence identity with the listed 
sequences. Factors that are most closely related to the listed sequences share, e.g., at least about 85%, about 90% or 
about 95% or more % sequence identity to the listed sequences. At the nucleotide level, the sequences will typically 

25 share at least about 40% nucleotide sequence identity, preferably at least about 50%, about 60%, about 70% or about 
80% sequence identity, and more preferably about 85%, about 90%, about 95% or about 97% or more sequence identity 
to one or more of the listed sequences. The degeneracy of the genetic code enables major variations in the nucleotide 
sequence of a polynucleotide while maintaining the amino acid sequence of the encoded protein. Conserved domains 
within a transcription factor family may exhibit a higher degree of sequence homology, such as at least 65% sequence 

30 identity including conservative substitutions, and preferably at least 80% sequence identity. 

Identifying Nucleic Acids by Hybridization 

[0035] Polynucleotides homologous to the sequences illustrated in the Sequence Listing can be identified, e.g., by 
35 hybridization to each other under stringent or under highly stringent conditions. Single stranded polynucleotides hybridize 
when they associate based on a variety of well characterized physico-chemical forces, such as hydrogen bonding, 
solvent exclusion, base stacking and the like. The stringency of a hybridization reflects the degree of sequence identity 
of the nucleic acids involved, such that the higher the stringency, the more similar are the two polynucleotide strands. 
Stringency is influenced by a variety of factors, including temperature, salt concentration and composition, organic and 
40 non-organic additives, solvents, etc. present in both the hybridization and wash solutions and incubations (and number), 
as described in more detail in the references cited above. 

[0036] An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have 
more than 100 complementary residues on a filter in a Southern or northern blot is about 5°C to 20°C lower than the 
thermal melting point (Tm) forthe specific sequence at a defined ionic strength and pH. The T m is the temperature (under 

45 defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Nucleic 
acid molecules that hybridize under stringent conditions will typically hybridize to a probe based on either the entire 
cDNA or selected portions, e.g., to a unique subsequence, of the cDNA under wash conditions of 0.2x SSC to 2.0 x 
SSC, 0.1% SDS at 50-65° C, for example 0.2 x SSC, 0.1% SDS at 65° C. For identification of less closely related 
homologues washes can be performed at a lower temperature, e.g., 50° C. In general, stringency is increased by raising 

50 the wash temperature and/or decreasing the concentration of SSC. 

[0037] As another example, stringent conditions can be selected such that an oligonucleotide that is perfectly com- 
plementary to the coding oligonucleotide hybridizes to the coding oligonucleotide with at least about a 5-1 Ox higher 
signal to noise ratio than the ratio for hybridization of the perfectly complementary oligonucleotide to a nucleic acid 
encoding a transcription factor known as of the filing date of the application. Conditions can be selected such that a 

55 higher signal to noise ratio is observed in the particular assay which is used, e.g., about 15x, 25x, 35x, 50x or more. 
Accordingly, the subject nucleic acid hybridizes to the unique coding oligonucleotide with at least a 2x higher signal to 
noise ratio as compared to hybridization of the coding oligonucleotide to a nucleic acid encoding known polypeptide. 
Again, higher signal to noise ratios can be selected, e.g., about 5x, 1 0x, 25x, 35x, 50x or more. The particular signal will 
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depend on the label used in the relevant assay, e.g., a fluorescent label, acolorimetric label, a radio active label, or the like. 
[0038] Alternatively, transcription factor homologue polypeptides can be obtained by screening an expression library 
using antibodies specific for one or more transcription factors. With the provision herein of the disclosed transcription 
factor, and transcription factor homologue nucleic acid sequences, the encoded polypeptide(s) can be expressed and 
purified in a heterologous expression system (e.g., E. colt) and used to raise antibodies (monoclonal or polyclonal) 
specific for the polypeptide(s) in question. Antibodies can also be raised against synthetic peptides derived from tran- 
scription factor, or transcription factor homologue, amino acid sequences. Methods of raising antibodies are well known 
in the art and are described in Harlow and Lane (1 988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
New York. Such antibodies can then be used to screen an expression library produced from the plant from which it is 
desired to clone additional transcription factor homologues, using the methods described above. The selected cDNAs 
can be confirmed by sequencing and enzymatic activity. 

SEQUENCE VARIATIONS 

[0039] It will readily be appreciated by those of skill in the art, that any of a variety of polynucleotide sequences are 
capable of encoding the transcription factors and transcription factor homologue polypeptides described herein. Due to 
the degeneracy of the genetic code, many different polynucleotides can encode identical and/or substantially similar 
polypeptides in addition to those sequences illustrated in the Sequence Listing. 

[0040] For example, Table 1 illustrates, e.g., that the codons AGC, AGT, TCA, TCC, TCG, and TCT all encode the 
same amino acid: serine. Accordingly, at each position in the sequence where there is a codon encoding serine, any of 
the above trinucleotide sequences can be used without altering the encoded polypeptide. 



Table 1 



Amino acids 


Codon 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


TGC 


TGT 










Aspartic acid 


Asp 


D 


GAC 


GAT 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


TTC 


TTT 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGT 






Histidine 


His 


H 


CAC 


CAT 










Isoleucine 


lie 


I 


ATA 


ATC 


ATT 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


TTA 


TTG 


CTA 


CTC 


CTG 


CTT 


Methionine 


Met 


M 


ATG 












Asparagine 


Asn 


N 


AAC 


AAT 










Proline 


Pro 


P 


CCA 


ccc 


CCG 


CCT 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGT 


Serine 


Ser 


S 


AGC 


AGT 


TCA 


TCC 


TCG 


TCT 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACT 






Valine 


Val 


V 


GTA 


GTC 


GTG 


GTT 






Tryptophan 


Trp 


W 


TGG 












Tyrosine 


Tyr 


Y 


TAC 


TAT 











[0041] Sequence alterations that do not change the amino acid sequence encoded by the polynucleotide are termed 
"silent" variations. With the exception of the codons ATG and TGG, encoding methionine and tryptophan, respectively, 
any of the possible codons for the same amino acid can be substituted by a variety of techniques, e.g., site-directed 
mutagenesis, available in the art. 

[0042] In addition to silent variations, other conservative variations that alter one, or a few amino acids in the encoded 
polypeptide, can be made without altering the function of the polypeptide. 

[0043] For example, substitutions, deletions and insertions introduced into the sequences provided in the Sequence 
Listing are also envisioned. Such sequence modifications can be engineered into a sequence by site-directed mutagen- 
esis (Wu (ed.) Meth. Enzymol. (1993) vol. 217, Academic Press) or the other methods noted below. Amino acid substi- 
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tutions are typically of single residues; insertions usually will be on the order of about from 1 to 1 0 amino acid residues; 
and deletions will range about from 1 to 30 residues. In preferred embodiments, deletions or insertions are made in 
adjacent pairs, e.g., a deletion of two residues or insertion of two residues. Substitutions, deletions, insertions or any 
combination thereof can be combined to arrive at a sequence. The mutations that are made in the polynucleotide encoding 
5 the transcription factor should not place the sequence out of reading frame and should not create complementary regions 
that could produce secondary mRNA structure. Preferably, the polypeptide encoded by the DNA performs the desired 
function. 

[0044] Conservative substitutions arethose in which at least one residue in the amino acid sequence has been removed 
and a different residue inserted in its place. Such substitutions generally are made in accordance with the Table 2 when 
10 it is desired to maintain the activity of the protein. Table 2 shows amino acids which can be substituted for an amino 
acid in a protein and which are typically regarded as conservative substitutions. 



Table 2 



Residue 


Conservative Substitutions 


Ala 
r\\a 




Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Gin 


Asn 


Cys 


Ser 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


lie 


Leu, Val 


Leu 


lie; Val 


Lys 


Arg; Gin 


Met 


Leu; lie 


Phe 


Met; Leu; Tyr 


Ser 


Thr; Gly 


Thr 


Ser;Val 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


lie; Leu 



[0045] Substitutions that are less conservative than those in Table 2 can be selected by picking residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, 
for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or 
(c) the bulk of the side chain. The substitutions which in general are expected to produce the greatest changes in protein 
properties will be those in which (a) a hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other 
residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is 
substituted for (or by) one not having a side chain, e.g., glycine. 



FURTHER MODIFYING SEQUENCES MUTATION/FORCED EVOLUTION 



[0046] In addition to generating silent or conservative substitutions as noted, above, described herein are methods of 
modifying the sequences of the Sequence Listing. In the methods, nucleic acid or protein modification methods are used 
to alter the given sequences to produce new sequences and/or to chemically or enzymatically modify given sequences 
to change the properties of the nucleic acids or proteins. 

[0047] Thus, given nucleic acid sequences are modified, e.g., according to standard mutagenesis or artificial evolution 
methods to produce modified sequences. For example, Ausubel, supra, provides additional details on mutagenesis 
methods. Artificial forced evolution methods are described, e.g., by Stemmer (1 994) Nature 370:389-391 , and Stemmer 
(1 994) Proc. Natl. Acad. Sci. USA 91 :1 0747-1 0751 . Many other mutation and evolution methods are also available and 
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expected to be within the skill of the practitioner. 

[0048] Similarly, chemical or enzymatic alteration of expressed nucleic acids and polypeptides can be performed by 
standard methods. For example, sequence can be modified by addition of lipids, sugars, peptides, organic or inorganic 
compounds, by the inclusion of modified nucleotides or amino acids, or the like. For example, protein modification 
5 techniques are illustrated in Ausubel, supra. Further details on chemical and enzymatic modifications can be found 
herein. These modification methods can be used to modify any given sequence, or to modify any sequence produced 
by the various mutation and artificial evolution modification methods noted herein. 

[0049] Accordingly, the specification provides for modification of any given nucleic acid by mutation, evolution, chemical 
or enzymatic modification, or other available methods, as well as for the products produced by practicing such methods, 

10 e.g., using the sequences herein as a starting substrate for the various modification approaches. 

[0050] For example, optimized coding sequence containing codons preferred by a particular prokaryotic or eukaryotic 
host can be used e.g., to increase the rate of translation or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, as compared with transcripts produced using a non-optimized sequence. Translation 
stop codons can also be modified to reflect host preference. For example, preferred stop codons for S. cerevisiae and 

15 mammals are TAA and TGA, respectively. The preferred stop codon for monocotyledonous plants is TGA, whereas 
insects and E. coli prefer to use TAA as the stop codon. 

[0051] The polynucleotide sequences can also be engineered in order to alter a coding sequence for a variety of 
reasons, including but not limited to, alterations which modify the sequence to facilitate cloning, processing and/or 
expression of the gene product. For example, alterations are optionally introduced using techniques which are well 
20 known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter glycosylation patterns, to change 
codon preference, to introduce splice sites, etc. 

[0052] Furthermore, a fragment or domain derived from any of the polypeptides described herein can be combined 
with domains derived from othertranscription factors or synthetic domains to modify the biological activity of a transcription 
factor. For instance, a DNA binding domain derived from a transcription factor described herein can be combined with 
25 the activation domain of another transcription factor or with a synthetic activation domain. A transcription activation 
domain assists in initiating transcription from a DNA binding site. Examples include the transcription activation region 
of VP1 6 or GAL4 (Moore et al. (1 998) Proc. Natl. Acad. Sci. USA 95: 376-381 ; and Aoyama et al. (1 995) Plant Cell 7: 
1 773-1 785), peptides derived from bacterial sequences (Ma and Ptashne (1 987) Cell 51 ; 1 1 3-1 1 9) and synthetic peptides 
(Ginigerand Ptashne, (1987) Nature 330:670-672). 

30 

EXPRESSION AND MODIFICATION OF POLYPEPTIDES 

[0053] Typically, polynucleotide sequences are incorporated into recombinant DNA (or RNA) molecules that direct 
expression of polypeptides of the invention in appropriate host cells, transgenic plants, in vitro translation systems, or 
35 the like. Due to the inherent degeneracy of the genetic code, nucleic acid sequences which encode substantially the 
same or a functionally equivalent amino acid sequence can be substituted for any listed sequence to provide for cloning 
and expressing the relevant homologue. 

Vectors, Promoters and Expression Systems 

40 

[0054] Described herein are recombinant constructs comprising one or more of the nucleic acid sequences disclosed 
herein. The constructs typically comprise a vector, such as a plasmid, a cosmid, a phage, a virus (e.g., a plant virus), a 
bacterial artificial chromosome (BAC), ayeast artificial chromosome (YAC), orthe like, into which a nucleic acid sequence 
of the invention has been inserted, in aforward or reverse orientation. In a preferred aspectthe constructfurther comprises 
45 regulatory sequences, including, for example, a promoter, operably linked to the sequence. Large numbers of suitable 
vectors and promoters are known to those of skill in the art, and are commercially available. 

[0055] General texts which describe molecular biological techniques useful herein, including the use and production 
of vectors, promoters and many other relevanttopics, include Berger, Sambrook and Ausubel, supra. Any of the identified 
sequences can be incorporated into a cassette or vector, e.g., for expression in plants. A number of expression vectors 

50 suitable for stable transformation of plant cells or for the establishment of transgenic plants have been described including 
those described in Weissbach and Weissbach, (1 989) Methods for Plant Molecular Biology, Academic Press, and Gelvin 
et al., (1990) Plant Molecular Biology Manual, Kluwer Academic Publishers. Specific examples include those derived 
from a Ti plasmid of Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella et al. (1983) Nature 
303: 209, Bevan (1 984) Nucl Acid Res. 1 2: 871 1 -8721 , Klee (1 985) Bio/Technology 3: 637-642, for dicotyledonous plants. 

55 [0056] Alternatively, non-Ti vectors can be used to transfer the DNA into monocotyledonous plants and cells by using 
free DNA delivery techniques. Such methods can involve, for example, the use of liposomes, electroporation, micropro- 
jectile bombardment, silicon carbide whiskers, and viruses. By using these methods transgenic plants such as wheat, 
rice (Christou (1 991 ) Bio/Technology 9: 957-962) and corn (Gordon-Kamm (1 990) Plant Cell 2: 603-61 8) can be produced. 
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An immature embryo can also be a good target tissue for monocots for direct DNA delivery techniques by using the 
particle gun (Weeks et al. (1993) Plant Physiol 102: 1077-1084; Vasil (1993) Bio/Technology 10: 667-674; Wan and 
Lemeaux (1994) Plant Physiol 104: 37-48, and for Agrobacteri urn -mediated DNA transfer (Ishida et al. (1996) Nature 
Biotech 14: 745-750). 

5 [0057] Typically, plant transformation vectors include one or more cloned plant coding sequence (genomic or cDNA) 
under the transcriptional control of 5' and 3' regulatory sequences and a dominant selectable marker. Such plant trans- 
formation vectors typically also contain a promoter (e.g., a regulatory region controlling inducible or constitutive, envi- 
ronmentally-or developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start site, an 
RNA processing signal (such as intron splice sites), a transcription termination site, and/or a polyadenylation signal. 

w [0058] Examples of constitutive plant promoters which can be useful for expressing the TF sequence include: the 
cauliflower mosaic virus (CaMV) 35S promoter, which confers constitutive, high-level expression in most plant tissues 
(see, e.g., Odel et al. (1985) Nature 313:810); the nopaline synthase promoter (An et al. (1988) Plant Physiol 88:547); 
and the octopine synthase promoter (Fromm et al. (1 989) Plant Cell 1 : 977). 

[0059] A variety of plant gene promoters that regulate gene expression in response to environmental, hormonal, 

15 chemical, developmental signals, and in a tissue-active manner can be used for expression of a TF sequence in plants. 
Choice of a promoter is based largely on the phenotype of interest and is determined by such factors as tissue (e.g., 
seed, fruit, root, pollen, vascular tissue, flower, carpel, etc.), inducibility (e.g., in response to wounding, heat, cold, 
drought, light, pathogens, etc.), timing, developmental stage, and the like. Numerous known promoters have been 
characterized and can favorable be employed to promote expression of a polynucleotide of the invention in a transgenic 

20 plant or cell of interest. For example, tissue specific promoters include: seed-specific promoters (such as the napin, 
phaseolin or DC3 promoter described in US Pat. No. 5,773,697), fruit-specific promoters that are active during fruit 
ripening (such as the dru 1 promoter (US Pat. No. 5,783,393), or the 2A1 1 promoter (US Pat. No. 4,943,674) and the 
tomato polygalacturonase promoter (Bird et al. (1988) Plant Mol Biol 11:651), root-specific promoters, such as those 
disclosed in US Patent Nos. 5,618,988, 5,837,848 and 5,905,186, pollen-active promoters such as PTA29, PTA26 and 

25 PTA1 3 (US Pat. No. 5,792,929), promoters active in vascular tissue (Ringli and Keller (1 998) Plant Mol Biol 37:977-988), 
flower-specific (Kaiser et al, (1 995) Plant Mol Biol 28:231 -243), pollen (Baerson et al. (1 994) Plant Mol Biol 26: 1 947-1 959), 
carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and ovules (Baerson et al. (1993) Plant Mol Biol 22:255-267), 
auxin-inducible promoters (such as that described in van der Kop et al. (1999) Plant Mol Biol 39:979-990 or Baumann 
et al. (1 999) Plant Cell 1 1 :323-334), cytokinin-inducible promoter (Guevara-Garcia (1 998) Plant Mol Biol 38:743-753), 

so promoters responsive to gibberellin (Shi et al. (1998) Plant Mol Biol 38:1053-1060, Willmott et al. (1998) 38:817-825) 
and the like. Additional promoters are those that elicit expression in response to heat (Ainley et al. (1 993) Plant Mol Biol 
22: 1 3-23), light (e.g., the pea rbcS-3A promoter, Kuhlemeier et al. (1 989) Plant Cell 1 :471 , and the maize rbcS promoter, 
Schaffner and Sheen (1991) Plant Cell 3: 997); wounding (e.g., wunl, Siebertz etal. (1989) Plant Cell 1: 961); pathogens 
(such as the PR-1 promoter described in Buchel et al. (1999) Plant Mol. Biol. 40:387-396, and the PDF1.2 promoter 

35 described in Manners et al. (1998) Plant Mol. Biol. 38:1071-80), and chemicals such as methyl jasmonate or salicylic 
acid (Gatz et al. (1 997) Plant Mol Biol 48: 89-1 08). In addition, the timing of the expression can be controlled by using 
promoters such as those acting at senescence (An and Amazon (1995) Science 270: 1986-1988); or late seed devel- 
opment (Odell et al. (1994) Plant Physiol 106:447-458). 

[0060] Plant expression vectors can also include RNA processing signals that can be positioned within, upstream or 
40 downstream of the coding sequence. In addition, the expression vectors can include additional regulatory sequences 
from the 3'-untranslated region of plant genes, e.g., a 3' terminator region to increase mRNA stability of the mRNA, such 
as the Pl-ll terminator region of potato or the octopine or nopaline synthase 3' terminator regions. 

Additional Expression Elements 

45 

[0061] Specific initiation signals can aid in efficient translation of coding sequences. These signals can include, e.g., 
the ATG initiation codon and adjacent sequences. In cases where a coding sequence, its initiation codon and upstream 
sequences are inserted into the appropriate expression vector, no additional translational control signals may be needed. 
However, in cases where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, 
50 exogenous transcriptional control signals including the ATG initiation codon can be separately provided. The initiation 
codon is provided in the correct reading frame to facilitate transcription. Exogenous transcriptional elements and initiation 
codons can be of various origins, both natural and synthetic. The efficiency of expression can be enhanced by the 
inclusion of enhancers appropriate to the cell system in use. 

55 Expression Hosts 

[0062] The present invention also relates to plant cells which are transduced with vectors described herein and the 
production of polypeptides (including fragments thereof) by recombinanttechniques. Host cells are genetically engineered 
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(i.e, nucleic acids are introduced, e.g., transduced, transformed ortransfected) with the vectors disclosed herein, which 
may be, for example, a cloning vector or an expression vector comprising the relevant nucleic acids herein. The vector 
is optionally a plasmid, a viral particle, a phage, a naked nucleic acids, etc. The engineered host cells can be cultured 
in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying 
5 the relevant gene. The culture conditions, such as temperature, pH and the like, are those previously used with the host 
cell selected for expression, and will be apparent to those skilled in the art and in the references cited herein, including, 
Sambrook and Ausubel. 

[0063] The host cell can be a plant cell. Plant protoplasts are also suitable for some applications. For example, the 
DNA fragments are introduced into plant tissues, cultured plant cells or plant protoplasts by standard methods including 

10 e I ectrop oration (Fromm et al., (1 985) Proc. Natl. Acad. Sci. USA 82, 5824, infection by viral vectors such as cauliflower 
mosaic virus (CaMV) (Hohn et al., (1982) Molecular Biology of Plant Tumors, (Academic Press, New York) pp. 549-560; 
US 4,407,956), high velocity ballistic penetration by small particles with the nucleic acid either within the matrix of small 
beads or particles, or on the surface (Klein et al., (1 987) Nature 327, 70-73), use of pollen as vector (WO 85/01 856), or 
use of Agrobacterium tumefaciens or A rhizogenes carrying a T-DNA plasmid in which DNA fragments are cloned. The 

15 T-DNA plasmid is transmitted to plantcells upon infection by Agrobacterium tumefaciens, and a portion is stably integrated 
into the plant genome (Horsch et al. (1 984) Science 233:496-498; Fraley et al. (1 983) Proc. Natl. Acad. Sci. USA 80, 4803). 
[0064] The cell can include a nucleic acid which encodes a polypeptide, wherein the cells expresses a polypeptide 
described herein. The cell can also include vector sequences, or the like. Furthermore, cells and transgenic plants which 
include any polypeptide or nucleic acid above or throughout this specification, e.g., produced by transduction of a vector 

20 of the invention, are an additional feature of the invention. 

[0065] For long-term, high-yield production of recombinant proteins, stable expression can be used. Host cells trans- 
formed with a nucleotide sequence encoding a polypeptide of the invention are optionally cultured under conditions 
suitable forthe expression and recovery of the encoded protein from cell culture. The protein orf ragmentthereof produced 
by a recombinant cell may be secreted, membrane-bound, or contained intracellular^, depending on the sequence 

25 and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides 
encoding mature proteins described herein can be designed with signal sequences which direct secretion of the mature 
polypeptides through a prokaryotic or eukaryotic cell membrane. 

Modified Amino Acids 

30 

[0066] Polypeptides may contain one or more modified amino acids. The presence of modified amino acids may be 
advantageous in, for example, increasing polypeptide half-life, reducing polypeptide antigenicity or toxicity, increasing 
polypeptide storage stability, or the like. Amino acid(s) are modified, for example, co-translationally or post-translationally 
during recombinant production or modified by synthetic or chemical means. 
35 [0067] Non-limiting examples of a modified amino acid include incorporation or other use of acetylated amino acids, 
glycosylated amino acids, sulfated amino acids, prenylated (e.g., farnesylated, geranylgeranylated) amino acids, PEG 
modified (e.g., "PEGylated") amino acids, biotinylated amino acids, carboxylated amino acids, phosphorylated amino 
acids, etc. References adequate to guide one of skill inthe modification of amino acids are replete throughout the literature. 

40 IDENTIFICATION OF ADDITIONAL FACTORS 

[0068] A transcription factor can also be used to identify additional endogenous or exogenous molecules that can 
affect a phentoype ortrait of interest. On the one hand, such molecules include organic (small or large molecules) and/or 
inorganic compounds that affect expression of (i.e., regulate) a particular transcription factor. Alternatively, such mole- 

45 cules include endogenous molecules that are acted upon either at a transcriptional level by a transcription factor to 
modify a phenotype as desired. For example, the transcription factors can be employed to identify one or more downstream 
gene with which is subject to a regulatory effect of the transcription factor. In one approach, a transcription factor or 
transcription factor homologue is expressed in a host cell, e.g, a transgenic plant cell, tissue or explant, and expression 
products, either RNA or protein, of likely or random targets are monitored, e.g., by hybridization to a microarray of nucleic 

50 acid probes corresponding to genes expressed in a tissue or cell type of interest, by two-dimensional gel electrophoresis 
of protein products, or by any other method known in the art for assessing expression of gene products at the level of 
RNA or protein. Alternatively, a transcription factor can be used to identify promoter sequences (i.e., binding sites) 
involved in the regulation of a downstream target. After identifying a promoter sequence, interactions between the 
transcription factor and the promoter sequence can be modified by changing specific nucleotides in the promoter se- 

55 quence or specific amino acids in the transcription factor that interact with the promoter sequence to alter a plant trait. 
Typically, transcription factor DNA binding sites are identified by gel shift assays. After identifying the promoter regions, 
the promoter region sequences can be employed in double-stranded DNA arrays to identify molecules that affect the 
interactions of the transcription factors with their promoters (Bulyk et al. (1999) Nature Biotechnology 17:573-577). 
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[0069] The identified transcription factors are also useful to identify proteins that modify the activity of the transcription 
factor. Such modification can occur by covalent modification, such as by phosphorylation, or by protein-protein (homo 
or-heteropolymer) interactions. Any method suitable for detecting protein-protein interactions can be employed. Among 
the methods that can be employed are co-immunoprecipitation, cross-linking and co -purification through gradients or 

5 chromatographic columns, and the two-hybrid yeast system. 

[0070] The two-hybrid system detects protein interactions in vivo and is described in Chien, et al., (1991), Proc. Natl. 
Acad. Sci. USA 88, 9578-9582 and is commercially available from Clontech (Palo Alto, Calif.). In such a system, plasmids 
are constructed that encode two hybrid proteins: one consists of the DNA-binding domain of a transcription activator 
protein fused to the TF polypeptide and the other consists of the transcription activator protein's activation domain fused 

10 to an unknown protein that is encoded by a cDNAthat has been recombined into the plasmid as part of a cDNA library. 
The DNA-binding domain fusion plasmid and the cDNA library are transformed into a strain of the yeast Saccharomyces 
cerevisiae that contains a reporter gene (e.g., lacZ) whose regulatory region contains the transcription activator's binding 
site. Either hybrid protein alone cannot activate transcription of the reporter gene. Interaction of the two hybrid proteins 
reconstitutes the functional activator protein and results in expression of the reporter gene, which is detected by an 

15 assay for the reporter gene product. Then, the library plasmids responsible for reporter gene expression are isolated 
and sequenced to identify the proteins encoded by the library plasmids. After identifying proteins that interact with the 
transcription factors, assays for compounds that interfere with the TF protein-protein interactions can be preformed. 

IDENTIFICATION OF MODULATORS 

20 

[0071] In addition totheintracellularmolecules described above, extracellular moleculesthat alter activity or expression 
of a transcription factor, either directly or indirectly, can be identified. For example, the methods can entail first placing 
a candidate molecule in contact with a plant or plant cell. The molecule can be introduced by topical administration, such 
as spraying or soaking of a plant, and then the molecule's effect on the expression or activity of the TF polypeptide or 

25 the expression of the polynucleotide monitored. Changes in the expression of the TF polypeptide can be monitored by 
use of polyclonal or monoclonal antibodies, gel electrophoresis orthe like. Changes in the expression of the corresponding 
polynucleotide sequence can be detected by use of microarrays, Northerns, quantitative PCR, or any other technique 
for monitoring changes in mRNA expression. These techniques are exemplified in Ausubel et al. (eds) Current Protocols 
in Molecular Biology, John Wiley & Sons (1 998). Such changes in the expression levels can be correlated with modified 

30 plant traits and thus identified molecules can be useful for soaking or spraying on fruit, vegetable and grain crops to 
modify traits in plants. 

[0072] Essentially any available composition can be tested for modulatory activity of expression or activity of any 
nucleic acid or polypeptide herein. Thus, available libraries of compounds such as chemicals, polypeptides, nucleic 
acids and the like can be tested for modulatory activity. Often, potential modulator compounds can be dissolved in 
35 aqueous or organic (e.g., DMSO-based) solutions for easy delivery to the cell or plant of interest in which the activity of 
the modulator is to be tested. Optionally, the assays are designed to screen large modulator composition libraries by 
automating the assay steps and providing compounds from any convenient source to assays, which are typically run in 
parallel (e.g., in microtiter formats on microtiter plates in robotic assays). 

[0073] In one embodiment, high throughput screening methods involve providing a combinatorial library containing a 
40 large number of potential compounds (potential modulator compounds). Such "combinatorial chemical libraries" are then 
screened in one or more assays, as described herein, to identify those library members (particular chemical species or 
subclasses) that display a desired characteristic activity. The compounds thus identified can serve as target compounds. 
[0074] A combinatorial chemical library can be, e.g., a collection of diverse chemical compounds generated by chemical 
synthesis or biological synthesis. For example, a combinatorial chemical library such as a polypeptide library is formed 
45 by combining a set of chemical building blocks (e.g., in one example, amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound of a set length). Exemplary libraries include 
peptide libraries, nucleic acid libraries, antibody libraries (see, e.g., Vaughn et al. (1996) Nature Biotechnology, 14(3): 
309-314 and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et al. Science (1996) 274:1520-1522 and U.S. 
Patent 5,593,853), peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), and small organic molecule libraries 
50 (see, e.g., benzodiazepines, Baum C&EN Jan 1 8, page 33 (1 993); isoprenoids, U.S. Patent 5,569,588; thiazolidinones 
and metathiazanones, U.S. Patent 5,549,974; pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino com- 
pounds, U.S. Patent 5,506,337) and the like. 

[0075] Preparation and screening of combinatorial or other libraries is well known to those of skill in the art. Such 
combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. Patent 5,01 0,1 75, Furka, 
55 int. J. Pept. Prot. Res. 37:487-493 (1991) and Houghton etal. Nature 354:84-88 (1991)). Other chemistries for generating 
chemical diversity libraries can also be used. 

[0076] In addition, as noted, compound screening equipment for high-throughput screening is generally available, 
e.g., using any of a number of well known robotic systems that have also been developed for solution phase chemistries 
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useful in assay systems. These systems include automated workstations including an automated synthesis apparatus 
and robotic systems utilizing robotic arms. Any of the above devices are suitable for use with the present invention, e.g., 
for high-throughput screening of potential modulators. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the relevant art. 
[0077] Indeed, entire high throughput screening systems are commercially available. These systemstypically automate 
entire procedures including all sample and reagent pipetting, liquid dispensing, timed incubations, and final readings of 
the microplate in detector(s) appropriate for the assay. These configurable systems provide high throughput and rapid 
start up as well as a high degree of flexibility and customization. Similarly, microfluidic implementations of screening are 
also commercially available. 

[0078] The manufacturers of such systems provide detailed protocols the various high throughput. Thus, for example, 
ZymarkCorp. provides technical bulletins describing screening systems for detecting the modulation of gene transcription, 
ligand binding, andthe like. The integrated systems herein, in additionto providingfor sequence alignment and, optionally, 
synthesis of relevant nucleic acids, can include such screening apparatus to identify modulators that have an effect on 
one or more polynucleotides or polypeptides according to the present invention. 

[0079] In some assays it is desirable to have positive controls to ensure that the components of the assays are working 
properly. At least two types of positive controls are appropriate. That is, known transcriptional activators or inhibitors 
can be incubated with cells/plants/ etc. in one sample of the assay, and the resulting increase/decrease in transcription 
can be detected by measuring the resulting increase in RNA/ protein expression, etc., according to the methods herein. 
It will be appreciated that modulators can also be combined with transcriptional activators or inhibitors to find modulators 
which inhibit transcriptional activation or transcriptional repression. Either expression of the nucleic acids and proteins 
herein or any additional nucleic acids or proteins activated by the nucleic acids or proteins herein, or both, can be 
monitored. 

[0080] Described herein is a method for identifying compositions that modulate the activity or expression of a polynu- 
cleotide or polypeptide of the invention. For example, a test compound, whether a small or large molecule, is placed in 
contact with a cell, plant (or plant tissue or explant), or composition comprising the polynucleotide or polypeptide of 
interest and a resulting effect on the cell, plant, (or tissue or explant) or composition is evaluated by monitoring, either 
directly or indirectly, one or more of: expression level of the polynucleotide or polypeptide, activity (or modulation of the 
activity) of the polynucleotide or polypeptide. In some cases, an alteration in aplantphenotypecan be detected following 
contact of a plant (or plant cell, or tissue or explant) with the putative modulator, e.g., by modulation of expression or 
activity of a polynucleotide or polypeptide of the invention. 

SUBSEQUENCES 

[0081] Also contemplated are uses of polynucleotides, also referred to herein as oligonucleotides, typically having at 
least 12 bases, preferably at least 15, more preferably at least 20, 30, or 50 bases, which hybridize under at least highly 
stringent (or ultra-high stringent or ultra-ultra- high stringent conditions) conditions to a polynucleotide sequence described 
above. The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, according to 
methods as noted supra. 

[0082] Subsequences of the polynucleotides described herein including polynucleotide fragments and oligonucleotides 
are useful as nucleic acid probes and primers. An oligonucleotide suitable for use as a probe or primer is at least about 
1 5 nucleotides in length, more often at least about 1 8 nucleotides, often at least about 21 nucleotides, frequently at least 
about 30 nucleotides, or about 40 nucleotides, or more in length. A nucleic acid probe is useful in hybridization protocols, 
e.g., to identify additional polypeptide homologues of the invention, including protocols for microarray experiments. 
Primers can be annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between 
the primer and the target DNA strand, and then extended along the target DNA strand by a DNA polymerase enzyme. 
Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or 
other nucleic-acid amplification methods. See Sambrook and Ausubel, supra. 

[0083] In addition, described herein are isolated or recombinant polypeptides including a subsequence of at least 
about 1 5 contiguous amino acids encoded by the recombinant or isolated polynucleotides disclosed herein. For example, 
such polypeptides, or domains or fragments thereof, can be used as immunogens, e.g., to produce antibodies specific 
for the polypeptide sequence, or as probes for detecting a sequence of interest. A subsequence can range in size from 
about 1 5 amino acids in length up to and including the full length of the polypeptide. 

PRODUCTION OF TRANSGENIC PLANTS 

Modification of Traits 

[0084] The polynucleotides disclosed herein are favorably employed to produce transgenic plants with various traits, 
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or characteristics, that have been modified in a desirable manner, e.g., to improve the environmental stress resistance 
of a plant. For example, alteration of expression levels or patterns (e.g., spatial or temporal expression patterns) of one 
or more of the transcription factors (or transcription factor homologues) described herein, as compared with the levels 
of the same protein found in a wild type plant, can be used to modify a plant's traits. An illustrative example of trait 
5 modification, improved environmental stress tolerance, by altering expression levels of a particular transcription factor 
is described further in the Examples and the Sequence Listing. 

Antisense and Cosuppression Approaches 

10 [0085] In addition to expression of the nucleic acids disclosed herein as gene replacement or plant phenotype modi- 
fication nucleic acids, the nucleic acids are also useful for sense and anti-sense suppression of expression, e.g., to 
down-regulate expression of a nucleic acid, e.g., as a further mechanism for modulating plant phenotype. That is, the 
nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can be used to block expression of 
naturally occurring homologous nucleic acids. A variety of sense and anti-sense technologies are known in the art, e.g., 

is assetforth in Lichtenstein and Nellen (1 997) Antisense Technology: A Practical Approach IRL Press at Oxford University, 
Oxford, England. In general, sense or anti-sense sequences are introduced into a cell, wherethey are optionally amplified, 
e.g., by transcription. Such sequences include both simple oligonucleotide sequences and catalytic sequences such as 
ribozymes. 

[0086] For example, a reduction or elimination of expression (i.e., a "knock-out") of a transcription factor ortranscription 
20 factor homologue polypeptide in a transgenic plant, e.g., to modify a plant trait, can be obtained by introducing an 
antisense construct corresponding to the polypeptide of interest as a cDNA. For antisense suppression, the transcription 
factor or homologue cDNA is arranged in reverse orientation (with respecttothe coding sequence) relative to the promoter 
sequence in the expression vector. The introduced sequence need not be the full length cDNA or gene, and need not 
be identical to the cDNA or gene found in the plant type to be transformed. Typically, the antisense sequence need only 
25 be capable of hybridizing to the target gene or RNA of interest. Thus, where the introduced sequence is of shorter length, 
a higher degree of homology to the endogenous transcription factor sequence will be needed for effective antisense 
suppression. While antisense sequences of various lengths can be utilized, preferably, the introduced antisense seq uence 
in the vector will be at least 30 nucleotides in length, and improved antisense suppression will typically be observed as 
the length of the antisense sequence increases. Preferably, the length of the antisense sequence in the vector will be 
30 greater than 100 nucleotides. Transcription of an antisense construct as described results in the production of RNA 
molecules that are the reverse complement of mRNA molecules transcribed from the endogenous transcription factor 
gene in the plant cell. 

[0087] Suppression of endogenous transcription factor gene expression can also be achieved using a ribozyme. 
Ribozymes are RNA molecules that possess highly specific endoribonuclease activity. The production and use of ri- 
35 bozymes are disclosed in U.S. Patent No. 4,987,071 and U.S. Patent No. 5,543,508. Synthetic ribozyme sequences 
including antisense RNAs can be used to confer RNA cleaving activity on the antisense RNA, such that endogenous 
mRNA molecules that hybridize to the antisense RNA are cleaved, which in turn leads to an enhanced antisense inhibition 
of endogenous gene expression. 

[0088] Vectors in which RNA encoded by a transcription factor or transcription factor homologue cDNA is over-ex- 
40 pressed can also be used to obtain co-suppression of a corresponding endogenous gene, e.g., in the manner described 
in U.S. Patent No. 5,231 ,020 to Jorgensen. Such co-suppression (also termed sense suppression) does not require that 
the entire transcription factor cDNA be introduced into the plant cells, nor does it require that the introduced sequence 
be exactly identical to the endogenous transcription factor gene of interest. However, as with antisense suppression, 
the suppressive efficiency will be enhanced as specificity of hybridization is increased, e.g., as the introduced sequence 
45 is lengthened, and/or as the sequence similarity between the introduced sequence and the endogenous transcription 
factor gene is increased. 

[0089] Vectors expressing an untranslatable form of the transcription factor mRNA, e.g., sequences comprising one 
or more stop codon, or nonsense mutation) can also be used to suppress expression of an endogenous transcription 
factor, thereby reducing or eliminating it's activity and modifying one or more traits. Methods for producing such constructs 
50 are described in U.S. Patent No. 5,583,021 . Preferably, such constructs are made by introducing a premature stop codon 
into the transcription factor gene. Alternatively, a plant trait can be modified by gene silencing using double-strand RNA 
(Sharp (1999) Genes and Development 13: 139-141). 

[0090] Another method for abolishing the expression of a gene is by insertion mutagenesis using the T-DNA of Agro- 
bacterium tumefaciens. After generating the insertion mutants, the mutants can be screened to identify those containing 
55 the insertion in a transcription factor ortranscription factor homologue gene. Plants containing a single transgene insertion 
event at the desired gene can be crossed to generate homozygous plants for the mutation (Koncz et al. (1 992) Methods 
in Arabidopsis Research, World Scientific). 

[0091] Alternatively, a plant phenotype can be altered by eliminating an endogenous gene, such as a transcription 
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factor or transcription factor homologue, e.g., by homologous recombination (Kempin et al. (1997) Nature 389:802). 
[0092] A plant trait can also be modified by using the cre-lox system (for example, as described in US Pat. No. 
5,658,772). A plant genome can be modified to include first and second lox sites that are then contacted with a Cre 
recombinase. If the lox sites are in the same orientation, the intervening DNA sequence between the two sites is excised. 

5 If the lox sites are in the opposite orientation, the intervening sequence is inverted. 

[0093] The polynucleotides and polypeptides described herein can also be expressed in a plant in the absence of an 
expression cassette by manipulating the activity or expression level of the endogenous gene by other means. For 
example, by ectopically expressing a gene by T-DNA activation tagging (Ichikawa et al. (1997) Nature 390 698-701; 
Kakimoto et al. (1996) Science 274: 982-985). This method entails transforming a plant with a gene tag containing 

10 multiple transcriptional enhancers and once the tag has inserted into the genome, expression of a flanking gene coding 
sequence becomes deregulated. In another example, the transcriptional machinery in a plant can be modified so as to 
increase transcription levels of a polynucleotide of the invention (See, e.g., PCT Publications WO 96/06166 and WO 
98/53057 which describe the modification of the DNA binding specificity of zinc finger proteins by changing particular 
amino acids in the DNA binding motif). 

15 [0094] The transgenic plant can also include the machinery necessary for expressing or altering the activity of a 
polypeptide encoded by an endogenous gene, for example by altering the phosphorylation state of the polypeptide to 
maintain it in an activated state. 

[0095] Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating the polynucleotides and/or 
expressing the polypeptides described herein can be produced by a variety of well established techniques as described 
20 above. Following construction of a vector, most typically an expression cassette, including a polynucleotide, e.g., encoding 
a transcription factor or transcription factor homologue, standard techniques can be used to introduce the polynucleotide 
into a plant, a plant cell, a plant explant or a plant tissue of interest. Optionally, the plant cell, explant or tissue can be 
regenerated to produce a transgenic plant. 

[0096] The plant can be any higher plant, including gymnosperms, monocotyledonous and dicotyledenous plants. 

25 Suitable protocols are available for Leguminosae (alfalfa, soybean, clover, etc.), Umbefliferae (carrot, celery, parsnip), 
Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), Curcurbitaceae (melon and cucumber), Gramineae (wheat, com, 
rice, barley, millet, etc.), Sofanaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See protocols 
described in Ammirato et al. (1984) Handbook of Plant Cell Culture -Crop Species. Macmillan Publ. Co. Shimamoto et 
al. (1989) Nature 338:274-276; Fromm et al. (1990) Bio/Technology 8:833-839; and Vasil et al. (1990) Bio/Technology 

30 8:429-434. 

[0097] Transformation and regeneration of both monocotyledonous and dicotyledonous plant cells is now routine, and 
the selection of the most appropriate transformation technique will be determined by the practitioner. The choice of 
method will vary with the type of plant to be transformed; those skilled in the art will recognize the suitability of particular 
methods for given plant types. Suitable methods can include, but are not limited to: electropo ration of plant protoplasts; 
35 liposome-mediated transformation; polyethylene glycol (PEG) mediated transformation; transformation using viruses; 
micro-injection of plant cells; micro-projectile bombardment of plant cells; vacuum infiltration; and Agrobacterium tume- 
ficiens mediated transformation. Transformation means introducing a nucleotide sequence in a plant in a manner to 
cause stable or transient expression of the sequence. 

[0098] Successful examples of the modification of plant characteristics by transformation with cloned sequences which 
40 serve to illustrate the current knowledge in this field of technology, include: U.S. Patent Nos. 5,571,706; 5, 677; 175; 
5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 5,780,708; 5,538,880; 5,773,269; 5,736,369 and 
5,610,042. 

[0099] Following transformation, plants are preferably selected using a dominant selectable marker incorporated into 
the transformation vector. Typically, such a marker will confer antibiotic or herbicide resistance on the transformed plants, 
45 and selection of transformants can be accomplished by exposing the plants to appropriate concentrations of the antibiotic 
or herbicide. 

[0100] After transformed plants are selected and grown to maturity, those plants showing a modified trait are identified. 
The modifed trait can be any of those traits described above. Additionally, to confirm that the modified trait is due to 
changes in expression levels or activity of the polypeptide or polynucleotide described herein can be determined by 
50 analyzing mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using immunoblots or 
Western blots or gel shift assays. 

INTEGRATED SYSTEMS-SEQUENCE IDENTITY 

55 [0101] Also described herein is an integrated system, computer or computer readable medium that comprises an 
instruction set for determining the identity of one or more sequences in a database, In addition, the instruction set can 
be used to generate or identify sequences that meet any specified criteria. Furthermore, the instruction set may be used 
to associate or link certain functional benefits, such improved environmental stress tolerance, with one or more identified 



17 



EP 1 230 345 B1 



sequence. 

[0102] For example, the instruction set can include, e.g., a sequence comparison or other alignment program, e.g., 
an available program such as, for example, the Wisconsin Package Version 10.0, such as BLAST, FASTA, PILEUP, 
FINDPATTERNS or the like (GCG, Madision, Wl). Public sequence databases such as GenBank, EMBL, Swiss-Prot 

5 and PIR or private sequence databases such as PhytoSeq (Incyte Pharmaceuticals, Palo Alto, CA) can be searched. 
[0103] Alignment of sequences for comparison can be conducted by the local homology algorithm of Smith and 
Waterman (1981) Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85: 
2444, by computerized implementations of these algorithms. After alignment, sequence comparisons between two (or 

10 more) polynucleotides or polypeptides are typically performed by comparing sequences of the two sequences over a 
comparison window to identify and compare local regions of sequence similarity. The comparison window can be a 
segment of at least about 20 contiguous positions, usually about 50 to about 200, more usually about 1 00 to about 1 50 
contiguous positions. A description of the method is provided in Ausubel et al., supra. 

[0104] A variety of methods of determining sequence relationships can be used, including manual alignment and 
is computer assisted sequence alignment and analysis. This later approach is a preferred approach in the present invention, 
due to the increased throughput afforded by computer assisted methods. As noted above, a variety of computer programs 
for performing sequence alignment are available, or can be produced by one of skill. 

[0105] One example algorithm that is suitable for determining percent sequence identity and sequence similarity is 
the BLAST algorithm, which is described in Altschul et al. J. Mol. Biol 215:403-410 (1990). Software for performing 

20 BLAST analyses is publicly available, e.g., through the National Center for Biotechnology Information (http://www.nc- 
bi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words 
of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned 
with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 

25 containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters 
M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 
0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in 
each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved 

30 value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue 
alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength 
(W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 1 0, and the BLOSUM62 

35 scoring matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915). 

[0106] In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis 
of the similarity between two sequences [see, e.g., Karlin & Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). 
One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. 

40 For example, a nucleic acid is considered similar to a reference sequence (and, therefore, in this context, homologous) 
if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 
0.1 , or less than about 0.01 , and or even less than about 0.001 . An additional example of a useful sequence alignment 
algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progres- 
sive, pairwise alignments. The program can align, e.g., up to 300 sequences of a maximum length of 5,000 letters. 

45 [0107] The integrated system, or computer typically includes a user input interface allowing a user to selectively view 
one or more sequence records corresponding to the one or more character strings, as well as an instruction set which 
aligns the one or more character strings with each other or with an additional character string to identify one or more 
region of sequence similarity. The system may include a link of one or more character strings with a particular phenotype 
or gene function. Typically, the system includes a user readable output element which displays an alignment produced 

50 by the alignment instruction set. 

[0108] The methods can be implemented in a localized or distributed computing environment. In a distributed envi- 
ronment, the methods may implemented on a single computer comprising multiple processors or on a multiplicity of 
computers. The computers can be linked, e.g. through a common bus, but more preferably the computer(s) are nodes 
on a network. The network can be a generalized or a dedicated local or wide-area network and, in certain preferred 

55 embodiments, the computers may be components of an intra-net or an internet. 

[0109] Thus, described herein are methods for identifying a sequence similar or homologous to one or more polynu- 
cleotides as noted herein, or one or more target polypeptides encoded by the polynucleotides, or otherwise noted herein 
and may include linking or associating a given plant phenotype or gene function with a sequence. In the methods, a 
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sequence database is provided (locally or across an inter or intra net) and a query is made againstthe sequence database 
using the relevant sequences herein and associated plant phenotypes or gene functions. 

[0110] Any sequence herein can be entered into the database, before or after querying the database. This provides 
for both expansion of the database and, if done before the querying step, for insertion of control sequences into the 
5 database. The control sequences can be detected by the query to ensure the general integrity of both the database and 
the query. As noted, the query can be performed using a web browser based interface. For example, the database can 
be a centralized public database such as those noted herein, and the querying can be done from a remote terminal or 
computer across an internet or intranet. 

10 EXAMPLES 

[0111] The following examples are intended to illustrate but not limit the present invention. 
EXAMPLE I. FULL LENGTH GENE IDENTIFICATION AND CLONING 

15 

[0112] Putative transcription factor sequences (genomic or ESTs) related to known transcription factors were identified 
in the Arabidopsis thaliana Gen Bank database using the tblastn sequence analysis program using default parameters 
and a P-value cutoff threshold of -4 or -5 or lower, depending on the length of the query sequence. Putative transcription 
factor sequence hits were then screened to identify those containing particular sequence strings. If the sequence hits 

20 contained such sequence strings, the sequences were confirmed as transcription factors. 

[0113] Alternatively, Arabidopsis thaliana cDNA libraries derived from different tissues or treatments, or genomic 
libraries were screened to identify novel members of a transcription family using a low stringency hybridization approach. 
Probes were synthesized using gene specific primers in a standard PCR reaction (annealing temperature 60° C) and 
labeled with 32 p dCTP using the High Prime DNA Labeling Kit (Boehringer Mannheim). Purified radiolabeled probes 

25 were added to filters immersed in Church hybridization medium (0.5 M NaP0 4 pH 7.0, 7% SDS, 1 % w/v bovine serum 
albumin) and hybridized overnight at 60°C with shaking. Filters were washed two times for 45 to 60 minutes with 1 xSCC, 
1%SDS at 60° C. 

[01 1 4] To identify additional sequence 5' or 3' of a partial cDN A sequence in a cDN A library, 5' and 3' rapid amplification 
of cDNA ends (RACE) was performed using the Marathon™ cDNA amplification kit (Clontech, Palo Alto, CA). Generally, 
30 the method entailed first isolating poly(A) mRNA, performing first and second strand cDNA synthesis to generate double 
stranded cDNA, blunting cDNA ends, followed by ligation of the Marathon™ Adaptor to the cDNA to form a library of 
adaptor-ligated ds cDNA. 

[0115] Gene-specific primers were designed to be used along with adaptor specific primers for both 5' and 3' RACE 
reactions. Nested primers, rather than single primers, were used to increase PCR specificity. Using 5' and 3' RACE 
35 reactions, 5' and 3' RACE fragments were obtained, sequenced and cloned. The process can be repeated until 5' and 
3' ends of the full-length gene were identified. Then the full-length cDNA was generated by PCR using primers specific 
to 5' and 3' ends of the gene by end-to-end PCR. 

EXAMPLE II. CONSTRUCTION OF EXPRESSION VECTORS 

40 

[0116] The sequence was amplified from a genomic or cDNA library using primers specific to sequences upstream 
and downstream of the coding region. The expression vector was pMEN20 or pMEN65, which are both derived from 
pMON316 (Sanders et al, (1987) Nucleic Acids Research 15:1543-58) and contain the CaMV 35S promoter to express 
transgenes. To clone the sequence into the vector, both pMEN20 and the amplified DNA fragment were digested 

45 separately with Sail and Notl restriction enzymes at 37° C for 2 hours. The digestion products were subject to electro- 
phoresis in a 0.8% agarose gel and visualized by ethidium bromide staining. The DNAfragments containing the sequence 
and the linearized plasmid were excised and purified by using a Qiaquick gel extraction kit (Qiagen, CA). The fragments 
of interest were ligated at a ratio of 3:1 (vector to insert). Ligation reactions using T4 DNA ligase (New England Biolabs, 
MA) were carried out at 1 6° C for 1 6 hours. The ligated DNAs were transformed into competent cells of the E. coli strain 

50 DH5alpha by using the heat shock method. The transformations were plated on LB plates containing 50 mg/l kanamycin 
(Sigma). 

[0117] Individual colonies were grown overnight in five milliliters of LB broth containing 50 mg/l kanamycin at 37° C. 
Plasmid DNA was purified by using Qiaquick Mini Prep kits (Qiagen, CA). 

55 EXAMPLE III. TRANSFORMATION OF A GROBACTERIUM WITH THE EXPRESSION VECTOR 

[0118] After the plasmid vector containing the gene was constructed, the vector was used to transform Agrobacterium 
tumefaciens cells expressing the gene products. The stock of Agrobacterium tumefaciens cells for transformation were 
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made as described by Nagel et al. (1990) FEMS Microbiol Letts. 67: 325-328. Agrobacterium strain ABI was grown in 
250 ml LB medium (Sigma) overnight at 28°C with shaking until an absorbance (A 600 ) of 0.5 - 1.0 was reached. Cells 
were harvested by centrifugation at 4,000 x g for 15 min at 4°C. Cells were then resuspended in 250 chilled buffer (1 
nriM HEPES, pH adjusted to 7.0 with KOH). Cells were centrifuged again as described above and resuspended in 125 
5 (xl chilled buffer. Cells were then centrifuged and resuspended two more times in the same HEPES buffer as described 
above at a volume of 100 |xl and 750 jutl, respectively. Resuspended cells were then distributed into 40 juul aliquots, quickly 
frozen in liquid nitrogen, and stored at -80° C. 

[0119] Agrobacterium cells were transformed with plasmids prepared as described above following the protocol de- 
scribed by Nagel et al. For each DNA construct to be transformed, 50 - 1 00 ng DNA (generally resuspended in 1 0 mM 

10 Tris-HCI, 1 mM EDTA, pH 8.0) was mixed with 40 pj of Agrobacterium cells. The DNA/cell mixture was then transferred 
to a chilled cuvette with a 2mm electrode gap and subject to a 2.5 kV charge dissipated at 25 fjiF and 200 ilF using a 
Gene Pulser II apparatus (Bio-Rad). After electroporation, cells were immediately resuspended in 1 .0 ml LB and allowed 
to recover without antibiotic selection for 2 - 4 hours at 28°C in a shaking incubator. After recovery, cells were plated 
onto selective medium of LB broth containing 100 fxg/ml spectinomycin (Sigma) and incubated for 24-48 hours at 28°C. 

15 Single colonies were then picked and inoculated in fresh medium. The presence of the plasmid construct was verified 
by PCR amplification and sequence analysis. 

EXAMPLE IV. TRANSFORMATION OF ARABIDOPSIS PLANTS WITH AGROBACTERIUM TU 'ME FAC I ENS WITH 
EXPRESSION VECTOR 

20 

[0120] After transformation of Agrobacterium tumefaciens with plasmid vectors containing the gene, single Agrobac- 
terium colonies were identified, propagated, and used to transform Arabidopsis plants. Briefly, 500 ml cultures of LB 
medium containing 50 mg/l kanamycin were inoculated with the colonies and grown at 28° C with shaking for 2 days 
until an absorbance (A 600 ) of > 2.0 is reached. Cells were then harvested by centrifugation at 4,000 x g for 1 0 min, and 
25 resuspended in infiltration medium (1/2 X Murashige and Skoog salts (Sigma), 1 X Gamborg's B-5 vitamins (Sigma), 
5.0% (w/v) sucrose (Sigma), 0.044 |ulM benzylamino purine (Sigma), 200 fxl/L Silwet L-77 (Lehle Seeds) until an absorb- 
ance (A 600 ) of 0.8 was reached. 

[0121] Prior to transformation, Arabidopsis thaliana seeds (ecotype Columbia) were sown at a density of ~ 1 0 plants 
per 4" pot onto Pro-Mix BX potting medium (Hummert International) covered with fiberglass mesh (18 mm X 16 mm). 
30 Plants were grown under continuous illumination (50-75 (xE/m 2 /sec) at 22-23° C with 65-70% relative humidity. After 
about 4 weeks, primary inflorescence stems (bolts) are cut off to encourage growth of multiple secondary bolts. After 
flowering of the mature secondary bolts, plants were prepared for transformation by removal of all siliques and opened 
flowers. 

[0122] The pots were then immersed upside down in the mixture of Agrobacterium infiltration medium as described 
35 above for 30 sec, and placed on their sides to allow draining into a 1 ' x 2' flat surface covered with plastic wrap. After 
24 h, the plastic wrap was removed and pots are turned upright. The immersion procedure was repeated one week later, 
for a total of two immersions per pot. Seeds were then collected from each transformation pot and analyzed following 
the protocol described below. 

40 EXAMPLE V. IDENTIFICATION OF ARABIDOPSIS PRIMARY TRANSFORM ANTS 

[0123] Seeds collected from the transformation pots were sterilized essentially as follows. Seeds were dispersed into 
in a solution containing 0.1% (v/v) Triton X-100 (Sigma) and sterile H 2 0 and washed by shaking the suspension for 20 
min. The wash solution was then drained and replaced with fresh wash solution to wash the seeds for 20 min with 

45 shaking. After removal of the second wash solution, a solution containing 0.1% (v/v) Triton X-100 and 70% ethanol 
(Equistar) was added to the seeds and the suspension was shaken for 5 min. After removal of the ethanol/detergent 
solution, a solution containing 0.1% (v/v) Triton X-100 and 30% (v/v) bleach (Clorox) was added to the seeds, and the 
suspension was shaken for 10 min. After removal of the bleach/detergent solution, seeds were then washed five times 
in sterile distilled H 2 0. The seeds were stored in the last wash water at 4° C for 2 days in the dark before being plated 

50 onto antibiotic selection medium (1 X Murashige and Skoog salts (pH adjusted to 5.7 with 1M KOH), I X Gamborg's B- 
5 vitamins, 0.9% phytagar (Life Technologies), and 50 mg/l kanamycin). Seeds were germinated under continuous 
illumination (50-75 (xE/m 2 /sec) at 22-23° C. After 7-10 days of growth under these conditions, kanamycin resistant 
primary transformants (T 1 generation) were visible and obtained. These seedlings were transferred first to fresh selection 
plates where the seedlings continued to grow for 3-5 more days, and then to soil (Pro-Mix BX potting medium). 

55 [0124] Primary transformants were crossed and progeny seeds (T 2 ) collected; kanamycin resistant seedlings were 
selected and analyzed. The expression levels of the recombinant polynucleotides in the transformants varies from about 
a 5% expression level increase to a least a 1 00% expression level increase. Similar observations are made with respect 
to polypeptide level expression. 
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EXAMPLE VI. IDENTIFICATION OF ARABIDOPSIS PLANTS WITH TRANSCRIPTION FACTOR GENE KNOCKOUTS 



[0125] The screening of insertion mutagenized Arabidopsis collections for null mutants in a known target gene was 
essentially as described in Krysan et al (1999) Plant Cell 1 1 :2283-2290. Briefly, gene-specific primers, nested by 5-250 

5 base pairs to each other, were designed from the 5' and 3' regions of a known target gene. Similarly, nested sets of 
primers were also created specific to each of the T-DNA ortransposon ends (the "right" and "left" borders). All possible 
combinations of gene specific and T-DNA/transposon primers were used to detect by PCR an insertion event within or 
close to the target gene. The amplified DNA fragments were then sequenced which allows the precise determination of 
the T-DNA/transposon insertion point relative to the target gene. Insertion events within the coding or intervening se- 

10 quence of the genes were deconvoluted from a pool comprising a plurality of insertion events to a single unique mutant 
plant for functional characterization. The method is described in more detail in Yu and Adam, US Application Serial No. 
09/1 77,733 filed October 23, 1 998. 

EXAMPLE VII. IDENTIFICATION OF ENVIRONMENTAL STRESS TOLERANCE PHENOTYPE IN OVEREX- 
15 PRESSOR OR GENE KNOCKOUT PLANTS 

[0126] Experiments were performed to identify those transformants or knockouts that exhibited an improved environ- 
mental stress tolerance. For such studies, the transformants were exposed to a variety of environmental stresses. Plants 
were exposed to chilling stress (6 hour exposure to 4-8°C), heat stress (6 hour exposure to 32-37°C), high salt stress 

20 (6 hour exposure to 200 mM NaCI), drought stress (1 68 hours after removing water from trays), osmotic stress (6 hour 
exposure to 3 M mannitol), or nutrient limitation (nitrogen, phosphate, and potassium) (Nitrogen: all components of MS 
medium remained constant except N was reduced to 20mg/L of NH 4 N03, or Phosphate: All components of MS medium 
except KH 2 PO 4, which was replaced by K2S04, Potassium: All components of MS medium except removal of KN03 
and KH2P04, which were replaced by NaH4P04). 

25 [0127] Table 3 shows the phenotypes observed for particular overexpressor or knockout plants and provides the SEQ 
ID No., the internal reference code (GID), whether a knockout or overexpressor plant was analyzed and the observed 
phenotype. 



Table 3 



SEQ ID No. 


GID 


Knockout (KO) or overexpressor (OX) 


Phenotype observed 


1 


G22 


OE 


Increased tolerance to high salt 


3 


G188 


KO 


Better germination under osmotic stress 


5 


G225 


OE 


Increased tolerance to nitrogen-limited medium 


7 


G226 


OE 


Increased tolerance to nitrogen-limited medium 


9 


G256 


OE 


Better germination and growth in cold 


11 


G419 


OE 


Increased tolerance to potassium-free medium 


13 


G464 


OE 


Better germination and growth in heat 


15 


G482 


OE 


Increased tolerance to high salt 


17 


G502 


KO 


Increased sensitivity to osmotic stress 


19 


G526 


OE 


Increased sensitivity to osmotic stress 


21 


G545 


OE 


Susceptible to high salt 


23 


G561 


OE 


Increased tolerance to potassium-free medium 


25 


G664 


OE 


Better germination and growth in cold 


27 


G682 


OE 


Better germination and growth in heat 


29 


G911 


OE 


Increased growth on potassium -free medium 


31 


G964 


OE 


Better germination and growth in heat 


33 


G394 


OE 


More sensitive to chilling 


35 


G489 


OE 


Increased tolerance to osmotic stress 
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[0128] For a particular overexpressor that shows a decreased tolerance to an environmental stress, it may be more 
useful to select a plant with a decreased expression of the particular transcription factor. For a particular knockout that 
shows a decreased tolerance to an environmental stress, it may be more useful to select a plant with an increased 
expression of the particular transcription factor. 

5 

EXAMPLE VIII. IDENTIFICATION OF HOMOLOGOUS SEQUENCES 

[0129] Homologous sequences from Arabidopsis and plant species other than Arabidopsis were identified using 
database sequence search tools, such as the Basic Local Alignment Search Tool (BLAST) (Altschul et al. (1 990) J. Mol. 
10 Biol. 21 5:403-41 0; and Altschul et al. (1 997) Nucl. Acid Res. 25: 3389-3402). The tblastx sequence analysis programs 
were employed using the BLOSUM-62 scoring matrix (Henikoff, S. and Henikoff, J. G. (1992) Proc. Natl. Acad Sci. USA 
89:0915-10919). 

[0130] Identified Arabidopsis homologous sequences are provided in Figure 2 and included in the Sequence Listing. 
The percent sequence identity among these sequences is as low as 47% sequence identity. Additionally, the entire 

15 NCBI GenBank database was filtered for sequences from all plants except Arabidopsis thaiiana by selecting all entries 
in the NCBI GenBank database associated with NCBI taxonomic ID 33090 (Viridiplantae; all plants) and excluding entries 
associated with taxonomic ID 3701 (Arabidopsis thaiiana). These sequences were compared to sequences representing 
genes of SEQ IDs Nos. 1-54 on 9/26/2000 using the Washington University TBLASTX algorithm (version 2.0a19MP). 
For each gene of SEQ IDs Nos. 1-54, individual comparisons were ordered by probability score (P-value), where the 

20 score reflects the probability that a particular alignment occurred by chance. For example, a score of 3.6e-40 is 3.6 x 
1 0" 40 . For up to ten species, the gene with the lowest P-value (and therefore the most likely homolog) is listed in Figure 3. 
[0131] In addition to P-values, comparisons were also scored by percentage identity. Percentage identity reflects the 
degree to which two segments of DNA or protein are identical over a particular length. The ranges of percent identity 
between the non-Arabidopsis genes shown in Figure 3 and the Arabidopsis genes in the sequence listing are: SEQ ID 

25 No. 1 : 53%-67%; SEQ ID No. 3: 38%-76%; SEQ ID No. 5: 34%-67%; SEQ ID No. 7: 50%-69%; SEQ ID No. 9: 32%- 
91%; SEQ ID No. 1 1: 48%-66%; SEQ ID No. 13: 34%-60%; SEQ ID No. 15: 58%-81%; SEQ ID No. 17: 65%-94%; SEQ 
ID No. 19: 72%-83%; SEQ ID No. 21: 52%-64%; SEQ ID No. 23: 40%-89%; SEQ ID No. 25: 86%-97%; SEQ ID No. 27: 
41%-75%; SEQ ID No. 29: 29%-72%; SEQ ID No. 31 : 49%-70%; SEQ ID No, 33: 56%-86%; SEQ ID No. 35: 61%-84%; 
SEQ ID No. 37: 40%-58%; SEQ ID No. 39: 63%-87%; SEQ ID No. 41: 51%-88%; SEQ ID No. 43: 80%-90%; SEQ ID 

30 No. 45: 79%-90%; SEQ ID No. 47: 30%-58%; SEQ ID No. 49: 52%-62%; SEQ ID No. 51 : 55%-73% and SEQ ID No. 
53: 44%-80%. 

[0132] The polynucleotides and polypeptides in the Sequence Listing and the identified homologous sequences may 
be stored in a computer system and have associated or linked with the sequences a function, such as that the polynu- 
cleotides and polypeptides are useful for modifying the environmental stress tolerance of a plant. 

35 

SEQUENCE LISTING 
[0133] 

40 <1 10> Pineda, Omaira 

Yu, Guo-Liang 

Creelman, Robert 

Riechmann, Jose Luis 

Heard, Jacqueline 
45 Ratcliffe, Oliver 

Reuber, Lynne 

Keddie, James 

<120> Environmental Stress Tolerance Genes 

50 

<130> MBI-0016 

<150> 60/166,228 
<151> 1999-11-17 

55 

<150> 60/197,899 
<151> 2000-04-17 
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<150> Plant Trait Modification III 
<151> 2000-08-22 

<160> 54 

5 

<170> Patentln version 3.0 

<210> 1 
<21 1> 913 
10 <212>DNA 

<213> Arabidopsis thaliana 



<220> 
<221> CDS 
15 <222> (81). .(761) 

<223> G22 

<400> 1 

agaaaacatc tctcactctc taaaatacac actctcatca aaaaccttct cttcggttca 60 

gaagcattca agaatccatt atg age tea tct gat tec gtt aat aac ggc gtt 113 

Met Ser Ser Ser Asp Ser Val Asn Asn Gly Val 
15 10 

25 aac tea egg atg tac ttc cgt aac ccg agt ttc age aac gtt ate tta 161 

Asn Ser Arg Met Tyr Phe Arg Asn Pro Ser Phe Ser Asn Val lie Leu 
15 20 25 

aac gat aac tgg age gac ttg ccg tta agt gtc gac gat tct caa gac 209 
Asn Asp Asn Trp Ser Asp Leu Pro Leu Ser Val Asp Asp Ser Gin Asp 
30 35 40 

atg get att tac aac act etc cgt gat gee gtt age tec ggc tgg aca 257 
Met Ala He Tyr Asn Thr Leu Arg Asp Ala Val Ser Ser Gly Trp Thr 
45 • 50 55 

ccc tec gtt cct ccc gtt ace tct ccg gcg gag gaa aat aag cct ccg 305 
35 Pro Ser Val Pro Pro Val Thr Ser Pro Ala Glu Glu Asn Lys Pro Pro 

60 65 70 75 

gcg acg aag gcg agt ggc tea cac gcg ccg agg cag aag ggg atg cag 353 
Ala Thr Lys Ala Ser Gly Ser His Ala Pro Arg Gin Lys Gly Met Gin 
80 85 90 

40 tac aga gga gtg agg agg agg ccg tgg ggg aaa ttc gcg gcg gag att 401 

Tyr Arg Gly Val Arg Arg Arg Pro Trp Gly Lys Phe Ala Ala Glu He 
95 100 105 

agg gat ccg aag aag aac gga get agg gtt tgg etc ggg act tac gag 44 9 

Arg Asp Pro Lys Lys Asn Gly Ala Arg Val Trp Leu Gly Thr Tyr Glu 
45 110 115 120 

acg ccg gag gac gcg gcg gtg gcg tac gac cga gcg gcg ttt cag etc 4 97 



50 



55 
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10 



20 



25 



40 



45 



50 



55 



Thr Pro Glu Asp Ala Ala Val Ala Tyr Asp Arg Ala Ala Phe Gin Leu 
125 130 135 

aga gga teg aaa get aag ctg aat ttt ccg cat ttg att ggt tct tgt 545 
Arg Gly Ser Lys Ala Lys Leu Asn Phe Pro His Leu lie Gly Ser Cys 
140 " 145 150 155 

aag tat gag ccg gtt agg att agg cct cgc cgt cgc teg ccg gaa ccg 593 
Lys Tyr Glu Pro Val Arg lie Arg Pro Arg Arg Arg Ser Pro Glu Pro 
160 165 170 

tea gtc tec gat cag tta acg teg gag cag aag agg gaa age cac gtg 641 
Ser Val Ser Asp Gin Leu Thr Ser Glu Gin Lys Arg Glu Ser His Val 
175 180 185 

gat gac ggc gag tct agt ttg gtt gta ccg gag ttg gat ttc acg gtg 689 
Asp Asp Gly Glu Ser Ser Leu Val Val Pro Glu Leu Asp Phe Thr Val 
190 195 200 

gat cag ttt tac ttc gat ggt agt tta tta atg gac caa tea gaa tgt 737 
Asp Gin Phe Tyr Phe Asp Gly Ser Leu Leu Met Asp Gin Ser Glu Cys 
205 " 210 215 

tct tat tct gat aat egg ata taa ttagttttaa gattaagcaa aatttgtcca 791 
Ser Tyr Ser Asp Asn Arg lie 
220 225 

acgagttttg ctgtatgaaa tatctatcga tgactcaaca ggttttgatc atgatcatat 851 

gtaatgtgat ggaaattaaa tattgaegtt tgtttttttg ttgtaaaaaa aaaaaaaaaa 911 

aa 913 

<210> 2 

<211> 226 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 2 

35 
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Met Ser Ser Ser Asp Ser Val Asn Asn Gly Val Asn Ser Arg Met Tyr 
15 10 15 

Phe Arg Asn Pro Ser Phe Ser Asn Val He Leu Asn Asp Asn Trp Ser 
20 25 30 

Asp Leu Pro Leu Ser Val Asp Asp Ser Gin Asp Met Ala He Tyr Asn 
35 40 45 

Thr Leu Arg Asp Ala Val Ser Ser Gly Trp Thr Pro Ser Val Pro Pro 
50 ~ 55 60 

Val Thr Ser Pro Ala Glu Glu Asn Lys Pro Pro Ala Thr Lys Ala Ser 
65 70 75 80 

Gly Ser His Ala Pro Arg Gin Lys Gly Met Gin Tyr Arg Gly Val Arg 
85 90 95 

Arg Arg Pro Trp Gly Lys Phe Ala Ala Glu He Arg Asp Pro Lys Lys 
100 105 110 

Asn Gly Ala Arg Val Trp Leu Gly Thr Tyr Glu Thr Pro Glu Asp Ala 
115 120 ' 125 

Ala Val Ala Tyr Asp Arg Ala Ala Phe Gin Leu Arg Gly Ser Lys Ala 
130 " 135 140 



Lys Leu Asn Phe Pro His Leu He Gly Ser Cys Lys Tyr Glu Pro Val 

145 150 155 160 

Arg He Arg Pro Arg Arg Arg Ser Pro Glu Pro Ser Val Ser Asp Gin 

165 170 175 



Leu Thr Ser Glu Gin -Lys Arg Glu Ser His Val Asp Asp Gly Glu Ser 
180 185 190 

Ser Leu Val Val Pro Glu Leu Asp Phe Thr Val Asp Gin Phe Tyr Phe 
195 200 205 

Asp Gly Ser Leu Leu Met Asp Gin Ser Glu Cys Ser Tyr Ser Asp Asn 
210 215 220 



Arg He 

225 



<210>3 
<211> 1195 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (50).. (1096) 
<223>G188 



25 



EP 1 230 345 B1 

<400> 3 

ctctcaccaa cataatcaaa gaagctttcc tcacgaattc aagatcgcc atg tec tec 58 

Met Ser Ser 

5 1 

gag gat tgg gat etc ttc gec gtc gtc aga age tgc age tct tct gtt 106 
Glu Asp Trp Asp Leu Phe Ala Val Val Arg Ser Cys Ser Ser Ser Val 
5 10 15 

tec acc acc aat tct tgt get ggt cat gaa gac gac ata gga aac tgt 154 
70 Ser Thr Thr Asn Ser Cys Ala Gly His Glu Asp Asp lie Gly Asn Cys 

20 25 30 35 

aaa caa caa caa gat cct cct cct cct cct ctg ttt caa get tct tct 202 
Lys Gin Gin Gin Asp Pro Pro Pro Pro Pro Leu Phe Gin Ala Ser Ser 
40 45 50 

tct tgc aac gag tta caa gat tct tgc aaa cca ttt tta ccc gtt act 250 
Ser Cys Asn Glu Leu Gin Asp Ser Cys Lys Pro Phe Leu Pro Val Thr 
55 60 65 

act act act act act act tgg tct cct cct cct eta ctt cct cct cct 298 
Thr Thr Thr Thr Thr Thr Trp Ser Pro Pro Pro Leu Leu Pro Pro Pro 
20 7 0 7 5 80 

aaa gee tea tea cca tct ccc aat ate tta eta aaa caa gaa caa gta 346 
Lys Ala Ser Ser Pro Ser Pro Asn lie Leu Leu Lys Gin Glu Gin Val 
85 90 95 



25 



35 



40 



45 



50 



ctt etc gaa tea caa gat caa aaa cct cct ctt agt gtt agg gtt ttc 3 94 

Leu Leu Glu Ser Gin Asp Gin Lys Pro Pro Leu Ser Val Arg Val Phe 

100 105 110 115 

cca cca tec act tct tct tct gtc ttt gtt ttt aga ggt caa cgc gac 442 

Pro Pro Ser Thr Ser Ser Ser Val Phe Val Phe Arg Gly Gin Arg Asp 

120 125 130 

cag ctt ctt caa caa caa tec caa cct ccc ctt cga tct aga aaa aga 4 90 



55 



26 



10 



20 
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Gin Leu Leu Gin Gin Gin Ser Gin Pro Pro Leu Arg Ser Arg Lys Arg 
135 140 145 

aag aat cag caa aaa aga acc ata tgt cat gta acg caa gag aat ctt 53 8 

Lys Asn Gin Gin Lys Arg Thr lie Cys His Val Thr Gin Glu Asn Leu 
150 155 160 

tct tct gat ttg tgg get tgg cgt aaa tac ggt caa aaa ccc ate aaa 586 
Ser Ser Asp Leu Trp Ala Trp Arg Lys Tyr Gly Gin Lys Pro lie Lys 
165 170 175 

ggc tct cct tat cca agg aat tat tac aga tgt agt age tea aaa gga 634 
Gly Ser Pro Tyr Pro Arg Asn Tyr Tyr Arg Cys Ser Ser Ser Lys Gly 
180 185 190 195 

tgt tta gca cga aaa caa gtt gaa aga agt aat tta gat cct aat ate 682 
Cys Leu Ala Arg Lys Gin Val Glu Arg Ser Asn Leu Asp Pro Asn lie 
200 205 210 

ttc ate gtt act tac acc gga gaa cac act cat cca cgt cct act cac 730 
Phe lie Val Thr Tyr Thr Gly Glu His Thr His Pro Arg Pro Thr His 
215 220 225 

egg aac tct etc gee gga agt act cgt aac aaa tct cag ccc gtt aac 778 
Arg Asn Ser Leu Ala Gly Ser Thr Arg Asn Lys Ser Gin Pro Val Asn 
230 235 240 

ccg gtt cct aaa ccg gac aca tct cct tta teg gat aca gta aaa gaa 826 
Pro Val Pro Lys Pro Asp Thr Ser Pro Leu Ser Asp Thr Val Lys Glu 
245 250 255 

gag att cat ctt tct ccg acg aca ccg ttg aaa gga aac gat gac gtt 874 
Glu lie His Leu Ser Pro Thr Thr Pro Leu Lys Gly Asn Asp Asp Val 
260 265 270 275 

caa gaa acg aat gga gat gaa gat atg gtt ggt caa gaa gtc aac atg 922 
Gin Glu Thr Asn Gly Asp Glu Asp Met Val Gly Gin Glu Val Asn Met 
280 285 290 

gaa gag gaa gag gag gaa gaa gaa gtg gaa gaa gat gat gaa gaa gaa 970 
Glu Glu Glu Glu Glu Glu Glu Glu Val Glu Glu Asp Asp Glu Glu Glu 
295 300 305 

gaa gat gat gat gac gtg gat gat ctt ttg ata cca aat tta gcg gtg 1018 
Glu Asp Asp Asp Asp Val Asp Asp Leu Leu lie Pro Asn Leu Ala Val 
310 315 320 

aga gat cga gat gat ttg ttc ttc get gga agt ttt cca tct tgg tec 1066 
Arg Asp Arg Asp Asp Leu Phe Phe Ala Gly Ser Phe Pro Ser Trp Ser 
325 330 335 

gee gga tec gee ggt gac ggt ggt gga tga tgaaaacgaa taaaatctca 1116 
Ala Gly Ser Ala Gly Asp Gly Gly Gly 
340 ' 345 

atttacaatt tacaaaaaga aaaaagtcag tttttaatta ttatttttgt ttgttaaaac 1176 
45 ttgacattta ttgtgttat 1195 

<210>4 
<211>348 
50 <212>PRT 

<213> Arabidopsis thaliana 

<400> 4 
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Met Ser Ser Glu Asp Trp Asp Leu Phe Ala Val Val Arg Ser Cys Ser 
15 10 15 



Ser Ser Val Ser Thr Thr Asn Ser Cys Ala Gly His Glu Asp Asp lie 
20 25 30 
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Gly Asn Cys Lys Gin Gin Gin Asp Pro Pro Pro Pro Pro Leu Phe Gin 
35 40 45 

Ala Ser Ser Ser Cys Asn Glu Leu Gin Asp Ser Cys Lys Pro Phe Leu 
50 55 60 

Pro Val Thr Thr Thr Thr Thr Thr Thr Trp Ser Pro Pro Pro Leu Leu 
65 70 75 80 

Pro Pro Pro Lys Ala Ser Ser Pro Ser Pro Asn lie Leu Leu Lys Gin 
85 90 95 

Glu Gin Val Leu Leu Glu Ser Gin Asp Gin Lys Pro Pro Leu Ser Val 
100 105 110 

Arg Val Phe Pro Pro Ser Thr Ser Ser Ser Val Phe Val Phe Arg Gly 
115 120 125 

Gin Arg Asp Gin Leu Leu Gin Gin Gin Ser Gin Pro Pro Leu Arg Ser 
130 135 140 

Arg Lys Arg Lys Asn Gin Gin Lys Arg Thr lie Cys His Val Thr Gin 
145 ~ ' 150 155 160 

Glu Asn Leu Ser Ser Asp Leu Trp Ala Trp Arg Lys Tyr Gly Gin Lys 
165 170 175 

Pro lie Lys Gly Ser Pro Tyr Pro Arg Asn Tyr Tyr Arg Cys Ser Ser 
180 185 190 

Ser Lys Gly Cys Leu Ala Arg Lys Gin Val Glu Arg Ser Asn Leu Asp 
195 200 205 

Pro Asn lie Phe lie Val Thr Tyr Thr Gly Glu His Thr His Pro Arg 
210 215 220 

Pro Thr His Arg Asn Ser Leu Ala Gly Ser Thr Arg Asn Lys Ser Gin 
225 ~ 230 235 ~ 240 

Pro Val Asn Pro Val Pro Lys Pro Asp Thr Ser Pro Leu Ser Asp Thr 
245 250 255 



.Val Lys Glu Glu lie His Leu Ser Pro Thr Thr Pro Leu Lys Gly Asn 
260 265 270 

Asp Asp Val Gin Glu Thr Asn Gly Asp Glu Asp Met Val Gly Gin Glu 
275 280 285 

Val Asn Met Glu Glu Glu Glu Glu Glu Glu Glu Val Glu Glu Asp Asp 
290 295 300 

Glu Glu Glu Glu Asp Asp Asp Asp Val Asp Asp Leu Leu lie Pro Asn 

305 ' 310 " 315 320 

Leu Ala Val Arg Asp Arg Asp Asp Leu Phe Phe Ala Gly Ser Phe Pro 
325 330 335 
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Met Phe Arg Ser Asp Lys Ala Glu Lys Met Asp Lys Arg Arg Arg Arg 
1 5 10 15 



Gin Ser Lys Ala Lys Ala Ser Cys Ser Glu Glu Val Ser Ser lie Glu 
20 25 30 



Trp Glu Ala Val Lys Met Ser Glu Glu Glu Glu Asp Leu He Ser Arg 
35 40 45 



Met Tyr Lys Leu Val Gly Asp Arg Trp Glu Leu He Ala Gly Arg He 
50 55 60 



Pro Gly Arg Thr Pro Glu Glu He Glu Arg Tyr Trp Leu Met Lys His 
65 70 75 80 



Gly Val Val Phe Ala Asn Arg Arg Arg Asp Phe Phe Arg Lys 
85 90 



<210> 7 
<211>407 
<212> DNA 

<213> Arabidopsis thaliana 
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ccagtagtt atg gat aat acc aac cgt ctt cgt ctt cgt cgc ggt ccc agt 51 
Met Asp Asn Thr Asn Arg Leu Arg Leu Arg Arg Gly Pro Ser 
15 10 

ctt agg caa act aag ttc act cga tec cga tat gac tct gaa gaa gtg 99 
Leu Arg Gin Thr Lys Phe Thr Arg Ser Arg Tyr Asp Ser Glu Glu Val 
15 20 25 30 



agt age ate gaa tgg gag ttt ate agt atg acc gaa caa gaa gaa gat 
Ser Ser lie Glu Trp Glu Phe He Ser Met Thr Glu Gin Glu Glu Asp 
35 40 45 



147 



etc ate tct cga atg tac aga ctt gtc ggt aat agg tgg gat tta ata 195 

Leu lie Ser Arg Met Tyr Arg Leu Val Gly Asn Arg Trp Asp Leu He 
50 55 60 

gca gga aga gtc gta gga aga aag gca aat gag att gag aga tac tgg 243 

Ala Gly Arg Val Val Gly Arg Lys Ala Asn Glu He Glu Arg Tyr Trp 

15 65 70 75 

att atg aga aac tct gac tat ttt tct cac aaa cga cga cgt ctt aat 291 

He Met Arg Asn Ser Asp Tyr Phe Ser His Lys Arg Arg Arg Leu Asn 

80 85 90 

20 aat tct ccc ttt ttt tct act tct cct ctt aat etc caa gaa aat eta 339 

Asn Ser Pro Phe Phe Ser Thr Ser Pro Leu Asn Leu Gin Glu Asn Leu 

95 100 105 110 

aaa ttg taa agaaatcaaa ataaaagctt tcaatcataa aagtagaaca 3 88 

Lys Leu 



aatcttgaat gtcttctca 407 



<210>8 
<211> 112 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 



Met Asp Asn Thr Asn Arg Leu Arg Leu Arg Arg Gly Pro Ser Leu Arg 
1 5 10 15 

Gin Thr Lys Phe Thr Arg Ser Arg Tyr Asp Ser Glu Glu Val Ser Ser 
20 25 30 

He Glu Trp Glu Phe He Ser Met Thr Glu Gin Glu Glu Asp Leu He 
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35 40 45 



Ser Arg Met Tyr Arg Leu Val Gly Asn Arg Trp Asp Leu lie Ala Gly 
50 55 60 



Arg Val Val Gly Arg Lys Ala Asn Glu He Glu Arg Tyr Trp He Met 
65 70 75 80 



Arg Asn Ser Asp Tyr Phe Ser His Lys Arg Arg Arg Leu Asn Asn Ser 
85 90 95 



Pro Phe Phe Ser Thr Ser Pro Leu Asn Leu Gin Glu Asn Leu Lys Leu 
100 105 110 



<210> 9 
<211> 1547 
<212> DNA 
<213> Arabidopsis 

<220> 
<221> CDS 
<222> (31 2). .(1310) 
<223> G256 

<400> 9 
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tcgtgagcgt 


tgtgtttctc ctcaacattc 


aaagtcttta 


gtgaaacctc tcttgtaaga 


60 




agccaaaaaa 


ataaagagaa agattcaaag 


aaggaaagaa 


attgaggatg actatttcaa 


120 


5 


gtccaaagag 


agattttgag tagaccctct 


tcacaaaaat 


ccaatcttag agtcttacta 


180 




gttactatct 


agcttacata cacagagaca 


ctataccaaa 


aatccaatct tattagagta 


24 0 




cttactatat 


agcttacaca tacacacaca 


cgaagtacta 


tttcaacgat caagagcgtg 


300 


10 


tgcgtgagga 


t atg ggt aga cca cct 
Met Gly Arg Pro Pro 


tgt tgc gag aag att gag gtg aag 
Cys Cys Glu Lys lie Glu Val Lys 


350 



1 5 10 

aaa gga cca tgg act ccc gaa gaa gac ata ate ttg gtc tct tat ate 398 

Lys Gly Pro Trp Thr Pro Glu Glu Asp lie lie Leu Val Ser Tyr lie 

15 20 25 

15 

caa caa cac ggc cct gga aat tgg aga tct gtc cct gca aac acc ggt 446 

Gin Gin His Gly Pro Gly Asn Trp Arg Ser Val Pro Ala Asn Thr Gly 
30 35 40 45 

ttg eta agg tgt age aag agt tgc aga ctt aga tgg act aat tac ctt 494 
Leu Leu Arg Cys Ser Lys Ser Cys Arg Leu Arg Trp Thr Asn Tyr Leu 
20 " 50 55 60 

cgt ccc ggg ate aaa cga gga aat ttc act caa ccg gaa gag aag atg 542 
Arg Pro Gly lie Lys Arg Gly Asn Phe Thr Gin Pro Glu Glu Lys Met 
65 ~ 70 75 
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ate ate cac ctt caa get ctt ttg gga aat aga tgg gca get ata gca 590 
lie He His Leu Gin Ala Leu Leu Gly Asn Arg Trp Ala Ala He Ala 
80 85 90 

tea tat eta cct cag agg acc gac aat gat ate aag aac tac tgg aac 63 8 

Ser Tyr Leu Pro Gin Arg Thr Asp Asn Asp He Lys Asn Tyr Trp Asn 
95 100 105 

act cat ctt aaa aag aaa eta gtg atg atg aag ttt caa aat ggt ate 686 
Thr His Leu Lys Lys Lys Leu Val Met Met Lys Phe Gin Asn Gly He 
110 * 115 120 125 

ate aac gaa aac aaa acc aat ctg gca aca gat att teg tct tgt aat 734 
He Asn Glu Asn Lys Thr Asn Leu Ala Thr Asp He Ser Ser Cys Asn 



34 



EP 1 230 345 B1 



130 135 140 

aat aac aac aat gga tgt aat cac aac aaa agg acc acc aac aaa ggc 782 

Asn Asn Asn Asn Gly Cys Asn His Asn Lys Arg Thr Thr Asn Lys Gly 
5 145 150 155 

caa tgg gag aaa aaa ctt caa aca gac ate aac atg gec aaa caa gec 83 0 

Gin Trp Glu Lys Lys Leu Gin Thr Asp lie Asn Met Ala Lys Gin Ala 
160 165 170 

io tta ttc caa gec ttg tea ctt gac caa cca tct tea ttg ate cct ccc 878 

Leu Phe Gin Ala Leu Ser Leu Asp Gin Pro Ser Ser Leu lie Pro Pro 

175 180 185 

gat cct gac tea cca aaa cct cat cat cat tct acc acc act tat gee 926 

Asp Pro Asp Ser Pro Lys Pro His His His Ser Thr Thr Thr Tyr Ala 

190 195 200 205 

15 

tea age aca gat aac ate tct aaa tta etc cag aac tgg aca age tea 974 

Ser Ser Thr Asp Asn lie Ser Lys Leu Leu Gin Asn Trp Thr Ser Ser 
210 215 220 

tea teg tea aag cct aac act tea tea gtc tec aac aac egg age tea 1022 

20 Ser Ser Ser Lys Pro Asn Thr Ser Ser Val Ser Asn Asn Arg Ser Ser 

225 230 235 

age ccc ggt gaa gga gga ctt ttt gat cat cac tct ttg ttc tea teg 1070 

Ser Pro Gly Glu Gly Gly Leu Phe Asp His His Ser Leu Phe Ser Ser 
240 245 250 

25 aat tea gaa tct gga tea gtt gat gag aag ctg aat ttg atg tec gag 1118 

Asn Ser Glu Ser Gly Ser Val Asp Glu Lys Leu Asn Leu Met Ser Glu 

255 260 265 

aca age atg ttc aaa ggt gag age aag cca gac ata gac atg gaa get 1166 

Thr Ser Met Phe Lys Gly Glu Ser Lys Pro Asp lie Asp Met Glu Ala 

30 270 275 280 285 

aca cct act act act act act act act gat gat caa ggc teg ttg tea 1214 

Thr Pro Thr Thr Thr Thr Thr Thr Thr Asp Asp Gin Gly Ser Leu Ser 
290 295 300 

ttg ate gag aaa tgg ttg ttt gat gat caa ggc ttg gtt cag tgt gat 1262 

35 Leu He Glu Lys Trp Leu Phe Asp Asp Gin Gly Leu Val Gin Cys Asp 

305 310 315 

gat agt caa gaa gat etc ate gac gtg tct tta gag gag tta aaa taa 1310 

Asp Ser Gin Glu Asp Leu He Asp Val Ser Leu Glu Glu Leu Lys 
320 325 330 

40 



45 



tgataacaac 


agtcaagatt 


tgttctataa 


gaaaataaaa 


cgtatagaac 


aacgataaag 


1370 


ctagctaggt 


ttattaattt 


ttctttcttt 


tgtcttttct 


ctatgatctt 


tagttacatt 


1430 


ttattttact 


gtgtggcttg 


cttgtggtca 


agtcgatgaa 


gatcaaactg 


tgatatacta 


1490 


tttatatgta 


aagtactata 


aagttaagag 


tagttgaata 


aaaaaaaaaa 


aaaaaaa 


1547 
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Met Gly Arg Pro Pro Cys Cys Glu Lys lie Glu Val Lys Lys Gly Pro 
1 ' 5 10 15 

Trp Thr Pro Glu Glu Asp lie He Leu Val Ser Tyr He Gin Gin His 
20 25 30 

Gly Pro Gly Asn Trp Arg Ser Val Pro Ala Asn Thr Gly Leu Leu Arg 
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35 40 45 

Cys Ser Lys Ser Cys Arg Leu Arg Trp Thr Asn Tyr Leu Arg Pro Gly 
50 55 60 

lie Lys Arg Gly Asn Phe Thr Gin Pro Glu Glu Lys Met lie lie His 
65 ~ 70 75 80 

Leu Gin Ala Leu Leu Gly Asn Arg Trp Ala Ala lie Ala Ser Tyr Leu 
85 90 95 

Pro Gin Arg Thr Asp Asn Asp lie Lys Asn Tyr Trp Asn Thr His Leu 
100 105 110 

Lys Lys Lys Leu Val Met Met Lys Phe Gin Asn Gly lie lie Asn Glu 
115 120 125 

Asn Lys Thr Asn Leu Ala Thr Asp He Ser Ser Cys Asn Asn Asn Asn 
130 135 140 

Asn Gly Cys Asn His Asn Lys Arg Thr Thr Asn Lys Gly Gin Trp Glu 
145 150 155 160 

Lys Lys Leu Gin Thr Asp He Asn Met Ala Lys Gin Ala Leu Phe Gin 
165 170 175 

Ala Leu Ser Leu Asp Gin Pro Ser Ser Leu He Pro Pro Asp Pro Asp 
180 185 190 

Ser Pro Lys Pro His His His Ser Thr Thr Thr Tyr Ala Ser Ser Thr 
195 200 205 

Asp Asn He Ser Lys Leu Leu Gin Asn Trp Thr Ser Ser Ser Ser Ser 
210 215 220 

Lys Pro Asn Thr Ser Ser Val Ser Asn Asn Arg Ser Ser Ser Pro Gly 
225 230 235 240 

Glu Gly Gly Leu Phe Asp His His Ser Leu Phe Ser Ser Asn Ser Glu 
245 250 255 

Ser Gly Ser Val Asp Glu Lys Leu Asn Leu Met Ser Glu Thr Ser Met 
260 265 270 

Phe Lys Gly Glu Ser Lys Pro Asp He Asp Met Glu Ala Thr Pro Thr 
275 280 285 

Thr Thr Thr Thr Thr Thr Asp Asp Gin Gly Ser Leu Ser Leu He Glu 
290 295 300 

Lys Trp Leu Phe Asp Asp Gin Gly Leu Val Gin Cys Asp Asp Ser Gin 
305 310 315 320 

Glu Asp Leu He Asp Val Ser Leu Glu Glu Leu Lys 
325 330 
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<211>2405 
<212> DNA 

<213> Arabidopsis thaliana 

5 
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dayCCaCdCa 


accccctt tc 


LLLLLLC.LLL 




♦*r , t~r , t"t"rTat" 

LLL\- LLLLy L. 


L. \, OCX LLLLL U 


60 


LdLLLLLLUL 


s-isy i- /"^ f» a t~ t" 
LyLLUaLLLL 




L L Ll» t-C LU. 


ILLL LLLLLfl 




120 


aagaagaaaa 


ataattcaca 


tctttatgca 


aactactttc 


ttgtagggtt 


ttaggagcta 


180 


tctctattgt 


cttggttctg 


atacaaagtt 


ttgtaatttt 


catggtatga 


gaagatttgc 


240 


ctttctattt 


tgtttattgg 


ttctttttaa 


ctttttcttg 


gagatgggtt 


cttgtagatc 


300 


ttaatgaaac 


ttctgttttt 


gtcccaaaaa 


gagttttctt 


ttttcttctc 


ttctttttgg 


360 


gttttcaatt 


cttgagagac 


atg gca aga gac cag ttc tat ggt cac aat aac 
Met Ala Arg Asp Gin Phe Tyr Gly His Asn Asn 
15 10 


413 



10 



cat cat cat caa gag caa caa cat caa atg att aat cag ate caa ggg 461 

His His His Gin Glu Gin Gin His Gin Met lie Asn Gin He Gin Gly 
15 20 25 

ttt gat gag aca aac caa aac cca acc gat cat cat cat tac aat cat 509 

Phe Asp Glu Thr Asn Gin Asn Pro Thr Asp His His His Tyr Asn His 

30 35 40 

cag ate ttt ggc tea aac tec aac atg ggt atg atg ata gac ttc tct 557 

Gin He Phe Gly Ser Asn Ser Asn Met Gly Met Met He Asp Phe Ser 
45 50 55 

aag caa caa cag att agg atg aca agt ggt teg gat cat cat cat cat 605 

Lys Gin Gin Gin He Arg Met Thr Ser Gly Ser Asp His His His His 

60 65 70 75 

cat cat cag aca agt ggt ggt act gat cag aat cag ctt ctg gaa gat 653 

His His Gin Thr Ser Gly Gly Thr Asp Gin Asn Gin Leu Leu Glu Asp 
80 85 90 

tct tea tct gee atg aga eta tgc aat gtt aat aat gat ttc cca agt 701 

Ser Ser Ser Ala Met Arg Leu Cys Asn Val Asn Asn Asp Phe Pro Ser 

30 95 100 105 

gaa gta aat gat gag aga cca cca caa aga cca age caa ggt ctt tec 74 9 

Glu Val Asn Asp Glu Arg Pro Pro Gin Arg Pro Ser Gin Gly Leu Ser 

110 115 120 

35 ctt tct etc tec tct tea aat cct aca age ate agt etc caa tct ttc 797 

Leu Ser Leu Ser Ser Ser Asn Pro Thr Ser He Ser Leu Gin Ser Phe 
125 130 135 

gaa etc aga ccc caa caa caa caa caa ggg tat tec ggt aat aaa tea 845 

Glu Leu Arg Pro Gin Gin Gin Gin Gin Gly Tyr Ser Gly Asn Lys Ser 

140 145 150 155 



aca caa cat cag aat etc caa cac acg cag atg atg atg atg atg atg 893 
Thr Gin His Gin Asn Leu Gin His Thr Gin Met Met Met Met Met Met 
160 165 170 

aat agt cac cac caa aac aac aac aat aac aat cat cag cat cat aat 941 
Asn Ser His His Gin Asn Asn Asn Asn Asn Asn His Gin His His Asn 
175 180 185 

cat cat cag ttt cag att ggg agt tec aag tat ttg agt cca get caa 989 
His His Gin Phe Gin He Gly Ser Ser Lys Tyr Leu Ser Pro Ala Gin 
190 195 200 
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gag eta ctg agt gag ttt tgc agt ctt gga gta aag gaa age gat gaa 103 7 

Glu Leu Leu Ser Glu Phe Cys Ser Leu Gly Val Lye Glu Ser Asp Glu 

205 210 * 215 

gaa gtg atg atg atg aag cat aag aag aag caa aag ggt aaa caa caa 1085 
Glu Val Met Met Met Lys His Lys Lys Lys Gin Lys Gly Lys Gin Gin 
220 225 230 235 

gaa gag tgg gac aca agt cac cac age aac aat gat caa cat gac caa 1133 
Glu Glu Trp Asp Thr Ser His His Ser Asn Asn Asp Gin His Asp Gin 
240 245 250 

tct gcg act act tct tea aag aaa cat gtt cca cca ctt cac tct ctt 1181 
Ser Ala Thr Thr Ser Ser Lys Lys His Val Pro Pro Leu His Ser Leu 
255 260 265 

gag ttc atg gaa ctt cag aaa aga aaa gec aag ttg etc tec atg etc 122 9 

Glu Phe Met Glu Leu Gin Lys Arg Lys Ala Lys Leu Leu Ser Met Leu 
270 275 280 

gaa gag ctt aaa aga aga tat gga cat tac cga gag caa atg aga gtt 1277 
Glu Glu Leu Lys Arg Arg Tyr Gly His Tyr Arg Glu Gin Met Arg Val 
285 290 295 

gcg gcg gca gec ttt gaa gcg gcg gtt gga eta gga ggg gca gag ata 1325 
Ala Ala Ala Ala Phe Glu Ala Ala Val Gly Leu Gly Gly Ala Glu He 
300 305 310 315 

tac act gcg tta gcg tea agg gca atg tea aga cac ttt egg tgt tta 1373 
Tyr Thr Ala Leu Ala Ser Arg Ala Met Ser Arg His Phe Arg Cys Leu 
320 325 330 

aaa gac gga ctt gtg gga cag att caa gca aca agt caa get ttg gga 1421 
Lys Asp Gly Leu Val Gly Gin He Gin Ala Thr Ser Gin Ala Leu Gly 
335 340 345 

gag aga gaa gag gat aat cgt gcg gtt tct att gca gca cgt gga gaa 1469 
Glu Arg Glu Glu Asp Asn Arg Ala Val Ser He Ala Ala Arg Gly Glu 

30 350 355 360 

act cca egg ttg aga ttg etc gat caa get ttg egg caa cag aaa teg 1517 
Thr Pro Arg Leu Arg Leu Leu Asp Gin Ala Leu Arg Gin Gin Lys Ser 
365 ~ 370 375 

tat cgc caa atg act ctt gtt gac get cat cct tgg cgt cca caa cgc 1565 
35 Tyr Arg Gin Met Thr Leu Val Asp Ala His Pro Trp Arg Pro Gin Arg 

380 385 390 395 

ggc ttg cct gaa cgc gca gtc aca acg ttg aga get tgg etc ttt gaa 1613 
Gly Leu Pro Glu Arg Ala Val Thr Thr Leu Arg Ala Trp Leu Phe Glu 
400 405 410 

40 cac ttt ctt cac cca tat ccg age gat gtt gat aag cat ata ttg gec 1661 

His Phe Leu His Pro Tyr Pro Ser Asp Val Asp Lys His He Leu Ala 
415 420 425 

cga caa act ggt tta tea aga agt cag gta tea aat tgg ttt att aat 1709 
Arg Gin Thr Gly Leu Ser Arg Ser Gin Val Ser Asn Trp Phe He Asn 
45 430 435 440 

gca aga gtt agg eta tgg aaa cca atg att gaa gaa atg tac tgt gaa 1757 
Ala Arg Val Arg Leu Trp Lys Pro Met lie Glu Glu Met Tyr Cys Glu 
445 450 455 

gaa aca aga agt gaa caa atg gag att aca aac ccg atg atg ate gat 1805 
50 Glu Thr Arg Ser Glu Gin Met Glu He Thr Asn Pro Met Met lie Asp 

460 465 470 475 

act aaa ccg gac ccg gac cag ttg ate cgt gtc gaa ccg gaa tct tta 1853 
Thr Lys Pro Asp Pro Asp Gin Leu He Arg Val Glu Pro Glu Ser Leu 
480 485 490 

55 tec tea ata gtg aca aac cct aca tec aaa tec ggt cac aac tea acc 1901 

Ser Ser lie Val Thr Asn Pro Thr Ser Lys Ser Gly His Asn Ser Thr 
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10 



495 500 505 

cat gga acg atg teg tta ggg tea acg ttt gac ttt tec ttg tac ggt 1949 
His Gly Thr Met Ser Leu Gly Ser Thr Phe Asp Phe Ser Leu Tyr Gly 
510 515 520 

aac caa get gtg aca tac get ggt gaa gga ggg cca cgt ggt gac gtt 1997 
Asn Gin Ala Val Thr Tyr Ala Gly Glu Gly Gly Pro Arg Gly Asp Val 
525 530 535 

tec ttg acg ctt ggg tta caa cgt aac gat ggt aac ggt ggt gtg agt 2045 
Ser Leu Thr Leu Gly Leu Gin Arg Asn Asp Gly Asn Gly Gly Val Ser 
540 545 550 555 

tta gcg ttg tct cca gtg acg get caa ggt ggc caa ctt ttc tac ggt 2093 
Leu Ala Leu Ser Pro Val Thr Ala Gin Gly Gly Gin Leu Phe Tyr Gly 
560 565 570 

aga gac cac att gaa gaa gga ccg gtt caa tat tea gcg teg atg tta 2141 
Arg Asp His lie Glu Glu Gly Pro Val Gin Tyr Ser Ala Ser Met Leu 
575 580 585 

gat gat gat caa gtt cag aat ttg cct tat agg aat ttg atg gga get 2189 
Asp Asp Asp Gin Val Gin Asn Leu Pro Tyr Arg Asn Leu Met Gly Ala 
590 595 600 

caa tta ctt cat gat att gtt tga gattaaaaga ttaggaccaa agttatcgat 2243 
Gin Leu Leu His Asp lie Val 
605 610 

25 acatattttc caaaaccgat tcggttatgt aacggtttag ttagataaaa accaaattag 2303 

atatttatat ataccgttgt ctgattggat tggaggattg gtggacaagg agatattatt 2363 

aatgtatgag ttagttggtt cgtcaaaaaa aaaaaaaaaa aa 2405 



<210> 12 
<211> 610 
<212> PRT 

<213> Arabidopsis thaliana 
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Met Ala Arg Asp Gin Phe Tyr Gly His Asn Asn His His His Gin Glu 
15 10 15 

Gin Gin His Gin Met lie Asn Gin lie Gin Gly Phe Asp Glu Thr Asn 
20 25 30 

Gin Asn Pro Thr Asp His His His Tyr Asn His Gin He Phe Gly Ser 
35 40 45 

Asn Ser Asn Met Gly Met Met He Asp Phe Ser Lys Gin Gin Gin He 
50 55 60 

Arg Met Thr Ser Gly Ser Asp His His His His His His Gin Thr Ser 
65 70 75 80 

Gly Gly Thr Asp Gin Asn Gin Leu Leu Glu Asp Ser Ser Ser Ala Met 
85 90 95 

Arg Leu Cys Asn Val Asn Asn Asp Phe Pro Ser Glu Val Asn Asp Glu 
100 105 110 

Arg Pro Pro Gin Arg Pro Ser Gin Gly Leu Ser Leu Ser Leu Ser Ser 
115 120 125 
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Ser Asn Pro Thr Ser lie Ser Leu Gin Ser Phe Glu Leu Arg Pro Gin 
130 135 140 

Gin Gin Gin Gin Gly Tyr Ser Gly Asn Lys Ser Thr Gin His Gin Asn 
145 150 155 160 

Leu Gin His Thr Gin Met Met Met Met Met Met Asn Ser His His Gin 
165 170 175 

Asn Asn Asn Asn Asn Asn His Gin His His Asn His His Gin Phe Gin 
180 185 190 

lie Gly Ser Ser Lys Tyr Leu Ser Pro Ala Gin Glu Leu Leu Ser Glu 
195 200 205 

Phe Cys Ser Leu Gly Val Lys Glu Ser Asp Glu Glu Val Met Met Met 
210 215 220 

Lys His Lys Lys Lys Gin Lys Gly Lys Gin Gin Glu Glu Trp Asp Thr 
225 230 235 240 

Ser His His Ser Asn Asn Asp Gin His Asp Gin Ser Ala Thr Thr Ser 
245 250 255 

Ser Lys Lys His Val Pro Pro Leu His Ser Leu Glu Phe Met Glu Leu 
260 265 270 

Gin Lys Arg Lys Ala Lys Leu Leu Ser Met Leu Glu Glu Leu Lys Arg 
275 280 285 

Arg Tyr Gly His Tyr Arg Glu Gin Met Arg Val Ala Ala Ala Ala Phe 
290 295 300 

Glu Ala Ala Val Gly Leu Gly Gly Ala Glu He Tyr Thr Ala Leu Ala 
305 310 315 320 

Ser Arg Ala Met Ser Arg His Phe Arg Cys Leu Lys Asp Gly Leu Val 
325 330 335 

Gly Gin He Gin Ala Thr Ser Gin Ala Leu Gly Glu Arg Glu Glu Asp 
340 345 350 

Asn Arg Ala Val Ser He Ala Ala Arg Gly Glu Thr Pro Arg Leu Arg 
355 360 365 

Leu Leu Asp Gin Ala Leu Arg Gin Gin Lys Ser Tyr Arg Gin Met Thr 
370 375 380 

Leu Val Asp Ala His Pro Trp Arg Pro Gin Arg Gly Leu Pro Glu Arg 
385 390 395 400 

Ala Val Thr Thr Leu Arg Ala Trp Leu Phe Glu His Phe Leu His Pro 

405 410 415 

Tyr Pro Ser Asp Val Asp Lys His He Leu Ala Arg Gin Thr Gly Leu 
420 425 430 



43 



EP 1 230 345 B1 



Ser Arg Ser Gin Val Ser Asn Trp Phe lie Asn Ala Arg Val Arg Leu 
435 440 445 



Trp Lys Pro Met He Glu Glu Met Tyr Cys Glu Glu Thr Arg Ser Glu 
450 455 460 



Gin Met Glu He Thr Asn Pro Met Met He Asp Thr Lys Pro Asp Pro 
465 470 475 480 



Asp Gin Leu He Arg Val Glu Pro Glu Ser Leu Ser Ser He Val Thr 
485 490 495 



Asn Pro Thr Ser Lys Ser Gly His Asn Ser Thr His Gly Thr Met Ser 
500 505 510 



Leu Gly Ser Thr Phe Asp Phe Ser Leu Tyr Gly Asn Gin Ala Val Thr 
515 520 525 



Tyr Ala Gly Glu Gly Gly Pro Arg Gly Asp Val Ser Leu Thr Leu Gly 
530 535 540 



Leu Gin Arg Asn Asp Gly Asn Gly Gly Val Ser Leu Ala Leu Ser Pro 
545 550 * 555 560 



Val Thr Ala Gin Gly Gly Gin Leu Phe Tyr Gly Arg Asp His He Glu 
565 570 575 



Glu Gly Pro Val Gin Tyr Ser Ala Ser Met Leu Asp Asp Asp Gin Val 
580 585 590 



Gin Asn Leu Pro Tyr Arg Asn Leu Met Gly Ala Gin Leu Leu His Asp 
595 " 600 605 



He Val 
610 

<210> 13 
<211> 989 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (41). .(664) 
<223> G464 

<400> 13 
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<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 



Met Arg Gly Val Ser Glu Leu Glu Val Gly Lys Ser Asn Leu Pro Ala 
15 10 15 

Glu Ser Glu Leu Glu Leu Gly Leu Gly Leu Ser Leu Gly Gly Gly Ala 
20 25 30 

Trp Lys Glu Arg Gly Arg He Leu Thr Ala Lys Asp Phe Pro Ser Val 
35 ' " 40 45 



Gly Ser Lys Arg Ser Ala Glu Ser Ser Ser His Gin Gly Ala Ser Pro 
50 "* 55 60 

Pro Arg Ser Ser Gin Val Val Gly Trp Pro Pro He Gly Leu His Arg 
65 70 75 80 

Met Asn Ser Leu Val Asn Asn Gin Ala Met Lys Ala Ala Arg Ala Glu 
85 90 95 

Glu Gly Asp Gly Glu Lys Lys Val Val Lys Asn Gly Glu Leu Lys Asp 
100 " 105 110 

Val Ser Met Lys Val Asn Pro Lys Val Gin Gly Leu Gly Phe Val Lys 
115 120 125 

Val Asn Met Asp Gly Val Gly He Gly Arg Lys Val Asp Met Arg Ala 
130 135 140 

His Ser Ser Tyr Glu Asn Leu Ala Gin Thr Leu Glu Glu Met Phe Phe 
145 150 155 160 

Gly Met Thr Gly Thr Thr Cys Arg Glu Thr Val Lys Pro Leu Arg Leu 
165 ' ~ 170 175 

Leu Asp Gly Ser Ser Asp Phe Val Leu Thr Tyr Glu Asp Lys Gly He 
180 185 190 

Gly Cys Leu Leu Glu Met Phe His Gly Glu Cys Leu Ser Thr Arg 
195 200 205 



<210> 15 
<211> 1065 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (188). .(760) 
<223> G482 

<400> 15 
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tcgacccacg cgtccggaca cttaacaatt cacaccttct ctttttactc ttcctaaaac 60 

cctaaatttc ctcgcttcag tcttcccact caagtcaacc accaattgaa ttcgatttcg 120 

5 aatcattgat ggaaatgatt tgaaaaaaga gtaaagttta tttttttatt ccttgtaatt 180 

ttcagaa atg ggg gat tec gac agg gat tec ggt gga ggg caa aac ggg 229 
Met Gly Asp Ser Asp Arg Asp Ser Gly Gly Gly Gin Asn Gly 
15 10 

aac aac cag aac gga cag tec tec ttg tct cca aga gag caa gac agg 2 77 

10 Asn Asn Gin Asn Gly Gin Ser Ser Leu Ser Pro Arg Glu Gin Asp Arg 

15 20 25 30 

ttc ttg ccg ate get aac gtc age egg ate atg aag aag gee ttg ccc 325 
Phe Leu Pro lie Ala Asn Val Ser Arg lie Met Lys Lys Ala Leu Pro 
35 40 45 

gec aac gee aag ate tct aaa gat gee aaa gag acg atg cag gag tgt 3 73 

Ala Asn Ala Lys lie Ser Lys Asp Ala Lys Glu Thr Met Gin Glu Cys 
50 55 60 



20 



40 



45 



50 



55 



gtc tec gag ttc ate age ttc gtc acc gga gaa gca tct gat aag tgt 421 



Val Ser Glu Phe lie Ser Phe Val Thr Gly Glu Ala Ser Asp Lys Cys 
65 70 ' 75 

cag aag gag aag agg aag acg ate aac gga gac gat ttg etc tgg get 469 
25 Gin Lys Glu Lys Arg Lys Thr lie Asn Gly Asp Asp Leu Leu Trp Ala 

80 85 90 

atg act act eta ggt ttt gag gat tat gtt gag cca ttg aaa gtt tac 517 

Met Thr Thr Leu Gly Phe Glu Asp Tyr Val Glu Pro Leu Lys Val Tyr 

95 100 105 110 

30 

ttg cag agg ttt agg gag ate gaa ggg gag agg act gga eta ggg agg 565 

Leu Gin Arg Phe Arg Glu lie Glu Gly Glu Arg Thr Gly Leu Gly Arg 

115 120 125 

cca cag act ggt ggt gag gtc gga gag cat cag aga gat get gtc gga 613 
Pro Gin Thr Gly Gly Glu Val Gly Glu His Gin Arg Asp Ala Val Gly 
35 1 30 1 35 1 40 

gat ggc ggt ggg ttc tac ggt ggt ggt ggt ggg atg cag tat cac caa 661 
Asp Gly Gly Gly Phe Tyr Gly Gly Gly Gly Gly Met Gin Tyr His Gin 
145 150 155 



cat cat cag ttt ctt cac cag cag aac cat atg tat gga gee aca ggt 709 
His His Gin Phe Leu His Gin Gin Asn His Met Tyr Gly Ala Thr Gly 
160 165 170 

ggc ggt age gac agt gga ggt gga get gee tec ggt agg aca agg act 757 
Gly Gly Ser Asp Ser Gly Gly Gly Ala Ala Ser Gly Arg Thr Arg Thr 
175 180 185 190 

taa caaagattgg tgaagtggat ctctctctgt atatagatac ataaatacat 810 

gtatacacat gectattttt acgacccata taaggtatct atcatgtgat agaacgaaca 870 

ttggtgttgg tgatgtaaaa tcagatgtgc attaagggtt tagattttga ggctgtgtaa 930 

aagaagatca agtgtgcttt gttggacaat aggattcact aacgaatctg cttcattgga 990 

tcttgtatgt aactaaagee attgtattga atgcaaatgt tttcatttgg gatgetttaa 1050 

aaaaaaaaaa aaaaa 1065 



<210> 16 
<211> 190 
<212> PRT 



47 



EP 1 230 345 B1 



<213> Arabidopsis thaliana 
<400> 16 



Met Gly Asp Ser Asp Arg Asp Ser Gly Gly Gly Gin Asn Gly Asn Asn 
15 10 15 

Gin Asn Gly Gin Ser Ser Leu Ser Pro Arg Glu Gin Asp Arg Phe Leu 
20 25 ~ 30 

Pro lie Ala Asn Val Ser Arg lie Met Lys Lys Ala Leu Pro Ala Asn 
35 40 45 

Ala Lys lie Ser Lys Asp Ala Lys Glu Thr Met Gin Glu Cys Val Ser 
50 55 60 

Glu Phe He Ser Phe Val Thr Gly Glu Ala Ser Asp Lys Cys Gin Lys 
65 70 75 80 

Glu Lys Arg Lys Thr He Asn Gly Asp Asp Leu Leu Trp Ala Met Thr 
85 90 *" 95 



Thr Leu Gly Phe Glu Asp Tyr Val Glu Pro Leu Lys Val Tyr Leu Gin 
100 105 110 

Arg Phe Arg Glu He Glu Gly Glu Arg Thr Gly Leu Gly Arg Pro Gin 

115 120 * 125 

Thr Gly Gly Glu Val Gly Glu His Gin Arg Asp Ala Val Gly Asp Gly 
130 135 140 

Gly Gly Phe Tyr Gly Gly Gly Gly Gly Met Gin Tyr His Gin His His 
145 150 155 160 

Gin Phe Leu His Gin Gin Asn His Met Tyr Gly Ala Thr Gly Gly Gly 
165 170 175 

Ser Asp Ser Gly Gly Gly Ala Ala Ser Gly Arg Thr Arg Thr 
180 185 190 



<210> 17 
<211> 1409 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221>CDS 
<222> (224). .(1093) 
<223> G502 

<400> 1 7 
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ttgatgccgc tcaatcccac tatccttcgc aaggaccctt cctctatata aggaagttca 60 
tttcatttgg agaggacacg ctgacaagct gactctagca gatctgggac cgtcgaccca 120 
cgcgtccgaa ttgattagga taggatcagg atcatcctca acaacctcct cctaattcct 180 

5 

cctccattca tagtaacaat aatattaaga aagagggtaa act atg tea gaa tta 235 

Met Ser Glu Leu 
1 

tta cag ttg cct cca ggt ttc cga ttt cac cct acc gat gaa gag ctt 283 
10 Leu Gin Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu Glu Leu 

5 10 15 20 

gtc atg cac tat etc tgc cgc aaa tgt gec tct cag tec ate gee gtt 331 
Val Met His Tyr Leu Cys Arg Lys Cys Ala Ser Gin Ser lie Ala Val 
25 30 35 

15 ccg ate ate get gag ate gat etc tac aaa tac gat cca tgg gag ctt 379 

Pro He He Ala Glu He Asp Leu Tyr Lys Tyr Asp Pro Trp Glu Leu 
40 45 50 

cct ggt tta gee ttg tat ggt gag aag gaa tgg tac ttc ttc tct ccc 427 
Pro Gly Leu Ala Leu Tyr Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro 
2Q 55 60 65 

agg gac aga aaa tat ccc aac ggt teg cgt cct aac egg tec get ggt 475 
Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn Arg Ser Ala Gly 
70 75 80 

tct ggt tac tgg aaa get acc gga get gat aaa ccg ate gga eta cct 523 
25 ser Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro He Gly Leu Pro 

85 90 95 100 

aaa ccg gtc gga att aag aaa get ctt gtt ttc tac gee ggc aaa get 571 
Lys Pro Val Gly lie Lys Lys Ala Leu Val Phe Tyr Ala Gly Lys Ala 
105 110 115 

30 

cca aag gga gag aaa acc aat tgg ate atg cac gag tac cgt etc gec 619 



35 



40 



45 



50 



55 
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10 



20 



25 



Pro Lys Gly Glu Lys Thr Asn Trp lie Met His Glu Tyr Arg Leu Ala 
120 125 130 

gac gtt gac egg tec gtt cgc aag aag aag aat agt etc agg ctg gat 667 
Asp Val Asp Arg Ser Val Arg Lys Lys Lys Asn Ser Leu Arg Leu Asp 
135 140 145 

gat tgg gtt etc tgc egg att tac aac aaa aaa gga get ace gag agg 715 
Asp Trp Val Leu Cys Arg lie Tyr Asn Lys Lys Gly Ala Thr Glu Arg 
150 155 160 

egg gga cca ccg cct ccg gtt gtt tac ggc gac gaa ate atg gag gag 763 
Arg Gly Pro Pro Pro Pro Val Val Tyr Gly Asp Glu He Met Glu Glu 
165 170 175 180 

aag ccg aag gtg acg gag atg gtt atg cct ccg ccg ccg caa cag aca 811 
Lys Pro Lys Val Thr Glu Met Val Met Pro Pro Pro Pro Gin Gin Thr 
185 190 195 

agt gag ttc gcg tat ttc gac acg teg gat teg gtg ccg aag ctg cat 859 
Ser Glu Phe Ala Tyr Phe Asp Thr Ser Asp Ser Val Pro Lys Leu His 
200 - * 205 210 

act acg gat teg agt tgc teg gag cag gtg gtg teg ccg gag ttc acg 907 
Thr Thr Asp Ser Ser Cys Ser Glu Gin Val Val Ser Pro Glu Phe Thr 
215 220 225 

age gag gtt cag age gag ccc aag tgg aaa gat tgg teg gee gta agt 955 
Ser Glu Val Gin Ser Glu Pro Lys Trp Lys Asp Trp Ser Ala Val Ser 
230 235 240 

aat gac aat aac aat ace ctt gat ttt ggg ttt aat tac att gat gee 1003 
Asn Asp Asn Asn Asn Thr Leu Asp Phe Gly Phe Asn Tyr He Asp Ala 
245 250 255 260 

ace gtg gat aac gcg ttt gga gga gga ggg agt agt aat cag atg ttt 1051 
Thr Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser Asn Gin Met Phe 
265 270 275 

ccg eta cag gat atg ttc atg tac atg cag aag cct tac tag 1093 
Pro Leu Gin Asp Met Phe Met Tyr Met Gin Lys Pro Tyr 
280 285 



35 


aagggaattc 


ctttcctgcc 


gccgaaacgc 


aacgeaaaac gaccctcgtt 


tttgcgttta 


1153 




tggcaacacg 


agaccgtttt 


atatggtcaa 


tgagtgtgcc gatteggeca 


ttagatttct 


1213 




gttcagtctt 


cgtttattct 


atagacegtc 


cgatttcaga tcatccctaa 


teggaeggtg 


1273 


40 


gtcgttggat 


gtatcagtag 


tgtattactg 


tgttaggtag aagaaaatcc 


acttgttctt 


1333 




aaattggcat 


aaaagtcaga 


agctaatatt 


tatatgtgcc gcaatcaatt 


taatattttc 


1393 




tgtctaaaaa 


aaaaaa 








1409 



45 <210>18 
<211>289 



<212> PRT 

<213> Arabidopsis thaliana 



so <400>18 



55 



50 
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Met Ser Glu Leu Leu Gin Leu Pro Pro Gly Phe Arg Phe His Pro Thr 
15 10 15 

5 Asp Glu Glu Leu Val Met His Tyr Leu Cys Arg Lys Cys Ala Ser Gin 

20 25 30 

Ser He Ala Val Pro He He Ala Glu He Asp Leu Tyr Lys Tyr Asp 
35 40 45 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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Pro Trp Glu Leu Pro Gly Leu Ala Leu Tyr Gly Glu Lys Glu Trp Tyr 
50 55 60 

Phe Phe Ser Pro Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 
65 70 75 80 

Arg Ser Ala Gly Ser Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 
85 90 95 

lie Gly Leu Pro Lys Pro Val Gly lie Lys Lys Ala Leu Val Phe Tyr 
100 105 110 

Ala Gly Lys Ala Pro Lys Gly Glu Lys Thr Asn Trp lie Met His Glu 
115 120 125 

Tyr Arg Leu Ala Asp Val Asp Arg Ser Val Arg Lys Lys Lys Asn Ser 
130 *" 135 140 

Leu Arg Leu Asp Asp Trp Val Leu Cys Arg He Tyr Asn Lys Lys Gly 
145 150 155 160 

Ala Thr Glu Arg Arg Gly Pro Pro Pro Pro Val Val Tyr Gly Asp Glu 
165 170 175 

He Met Glu Glu Lys Pro Lys Val Thr Glu Met Val Met Pro Pro Pro 
180 " 185 190 

Pro Gin Gin Thr Ser Glu Phe Ala Tyr Phe Asp Thr Ser Asp Ser Val 
195 200 205 

Pro Lys Leu His Thr Thr Asp Ser Ser Cys Ser Glu Gin Val Val Ser 
210 215 220 

Pro Glu Phe Thr Ser Glu Val Gin Ser Glu Pro Lys Trp Lys Asp Trp 
225 230 235 240 

Ser Ala Val Ser Asn Asp Asn Asn Asn Thr Leu Asp Phe Gly Phe Asn 
245 250 255 

Tyr He Asp Ala Thr Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser 
260 265 270 

Asn Gin Met Phe Pro Leu Gin Asp Met Phe Met Tyr Met Gin Lys Pro 
275 280 285 



Tyr 



<210> 19 
<211> 1481 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221>CDS 
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<222> (181). .(1188) 
<223> G526 

<400> 19 

5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
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10 



20 



cgacccacgc gtccgagatt ctctcccagc tagctttctc aattcatttt tctttcttca 60 

tcttcttctt gtgtgatctc tctttccaaa taagcttatc attcttacaa aaatatttct 120 

gggtctctga tattgttctt gttctcttga atctttatta cttgaaaaac atataaagtg 1B0 

atg gcg gtt gtg gtt gaa gaa ggt gtg gtg ttg aat cat gga ggt gaa 228 
Met Ala Val Val Val Glu Glu Gly Val Val Leu Asn His Gly Gly Glu 
15 10 15 

gag ctt gtg gat ttg cca cct ggt ttc agg ttt cat cca aca gac gaa 276 
Glu Leu Val Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu 
20 25 30 

gag ate ata aca tgt tac ctt aag gag aag gtt tta aac age cga ttc 324 
Glu lie lie Thr Cys Tyr Leu Lys Glu Lys Val Leu Asn Ser Arg Phe 
35 40 45 

acg get gtg gec atg gga gaa get gat etc aac aag tgt gag cct tgg 3 72 

Thr Ala Val Ala Met Gly Glu Ala Asp Leu Asn Lys Cys Glu Pro Trp 
50 55 60 

gat ttg cca aag agg gca aag atg ggg gag aaa gag ttc tac ttc ttc 4 20 

Asp Leu Pro Lys Arg Ala Lys Met Gly Glu Lys Glu Phe Tyr Phe Phe 
65 70 75 80 

tgt caa agg gac agg aag tat ccg act ggg atg agg acg aac cgt gcg 4 68 

Cys Gin Arg Asp Arg Lys Tyr Pro Thr Gly Met Arg Thr Asn Arg Ala 
85 90 95 

acg gag tea gga tac tgg aaa gee ace ggg aag gat aag gag ate ttc 516 
2 5 Thr Glu Ser Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu lie Phe 

100 105 110 

aaa ggc aaa ggt tgt etc gtt ggg atg aag aaa aca ctt gtg ttt tat 564 
Lys Gly Lys Gly Cys Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr 
115 120 125 

30 aga gga aga get cca aaa ggt gaa aag act aat tgg gtc atg cat gaa 612 

Arg Gly Arg Ala Pro Lys Gly Glu Lys Thr Asn Trp Val Met His Glu 
130 135 140 

tat cgt ctt gaa ggc aaa tat teg tat tac aat etc cca aaa tct gca 660 

Tyr Arg Leu Glu Gly Lys Tyr Ser Tyr Tyr Asn Leu Pro Lys Ser Ala 

145 150 155 160 

do 

agg gac gaa tgg gtc gtg tgt agg gtt ttt cac aag aac aat cct tct 708 

Arg Asp Glu Trp Val Val Cys Arg Val Phe His Lys Asn Asn Pro Ser 

165 170 175 

acc aca acc caa cca atg acg aga ata ccc gtt gaa gat ttc aca agg 756 
40 T hr Thr Thr Gin Pro Met Thr Arg lie Pro Val Glu Asp Phe Thr Arg 

180 185 190 

atg gat tct eta gag aac att gat cat etc eta gac ttc tea tct ctt 804 
Met Asp Ser Leu Glu Asn lie Asp His Leu Leu Asp Phe Ser Ser Leu 
195 200 205 

45 cct cct etc ata gac ccg agt ttc atg agt caa acc gaa caa cca aac 852 

Pro Pro Leu He Asp Pro Ser Phe Met Ser Gin Thr Glu Gin Pro Asn 
210 215 220 

ttc aaa ccc ate aac cct cca act tac gat ate tea tea cca ate caa 900 
Phe Lys Pro He Asn Pro Pro Thr Tyr Asp He Ser Ser Pro He Gin 
50 2 2 5 2 3 0 2 3 5 2 4 0 

ccc cat cat ttc aat tct tac caa tea ate ttt aac cac cag gtt ttt 948 
Pro His His Phe Asn Ser Tyr Gin Ser He Phe Asn His Gin Val Phe 
245 250 255 

ggt tct get teg ggc tct acg tac aac aac aac aac gag atg ate aag 996 
55 Gly Ser Ala Ser Gly Ser Thr Tyr Asn Asn Asn Asn Glu Met He Lys 

260 265 270 



54 



EP 1 230 345 B1 



atg gag caa tea ctt gtt agt gta tct caa gaa aca tgc eta age tea 1044 
Met Glu- Gin Ser Leu Val Ser Val Ser Gin Glu Thr Cys Leu Ser Ser 
275 280 285 

gat gtg aac gcg aac atg act aca ace acg gag gta tct teg ggt cct 1092 
Asp val Asn Ala Asn Met Thr Thr Thr Thr Glu Val Ser Ser Gly Pro 
290 295 300 

gta atg aaa caa gaa atg ggg atg atg gga atg gtg aat ggt age aag 1140 
Val Met Lys Gin Glu Met Gly Met Met Gly Met Val Asn Gly Ser Lys 
10 305 310 315 320 

teg tat gaa gat eta tgt gac ttg agg ggg gac ttg tgg gac ttc taa 1188 
Ser Tyr Glu Asp Leu Cys Asp Leu Arg Gly Asp Leu Trp Asp Phe 
325 330 335 



20 



<210> 20 
25 <211>335 



ttaatcattt gactgtggtg 


aaagagtata 


tttgttggga 


tttaaatcat gttagttaat 


1248 


acatatacat 


ataggattta 


ctagaggctt 


aatcctagtt 


aactattttc 


acttcattga 


1308 


tattatttaa 


ttagttgatt gtttaattag 


tttatacttt 


atagtgtggt 


taaaaaagaa 


1368 


aagaaaggat 


tgtgataatt 


tgggatttta gtgcataagt 


tatatctcaa 


tgtaaactgt 


1428 


atttgtatcc 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 


1481 



<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 
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40 



45 



50 
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Met Ala Val Val Val Glu Glu Gly Val Val Leu Asn His Gly Gly Glu 
15 10 15 

Glu Leu Val Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu 
20 25 30 

Glu lie He Thr Cys Tyr Leu Lye Glu Lys Val Leu Asn Ser Arg Phe 
35 40 45 

Thr Ala Val Ala Met Gly Glu Ala Asp Leu Asn Lys Cys Glu Pro Trp 
50 55 60 

Asp Leu Pro Lys Arg Ala Lys Met Gly Glu Lys Glu Phe Tyr Phe Phe 
65 70 75 80 

Cys Gin Arg Asp Arg Lys Tyr Pro Thr Gly Met Arg Thr Asn Arg Ala 
85 * 90 95 

Thr Glu Ser Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu He Phe 
100 105 HO 

Lys Gly Lys Gly Cys Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr 
115 120 125 

Arg Gly Arg Ala Pro Lys Gly Glu Lys Thr Asn Trp Val Met His Glu 
130 135 140 

Tyr Arg Leu Glu Gly Lys Tyr Ser Tyr Tyr Asn Leu Pro Lys Ser Ala 
145 150 155 160 

Arg Asp Glu Trp Val Val Cys Arg Val Phe His Lys Asn Asn Pro Ser 
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165 170 175 



Thr Thr Thr Gin Pro Met Thr Arg lie Pro Val Glu Asp Phe Thr Arg 
180 185 190 



Met Asp Ser Leu Glu Asn lie Asp His Leu Leu Asp Phe Ser Ser Leu 
195 200 205 



Pro Pro Leu lie Asp Pro Ser Phe Met Ser Gin Thr Glu Gin Pro Asn 
210 215 220 



Phe Lys Pro He Asn Pro Pro Thr Tyr Asp He Ser Ser Pro He Gin 
225 230 " 235 240 



Pro His His Phe Asn Ser Tyr Gin Ser He Phe Asn His Gin Val Phe 
245 250 255 



Gly Ser Ala Ser Gly Ser Thr Tyr Asn Asn Asn Asn Glu Met He Lys 
260 265 270 



Met Glu Gin Ser Leu Val Ser Val Ser Gin Glu Thr Cys Leu Ser Ser 
275 280 285 



Asp Val Asn Ala Asn Met Thr Thr Thr Thr Glu Val Ser Ser Gly Pro 
290 295 300 



Val Met Lys Gin Glu Met Gly Met Met Gly Met Val Asn Gly Ser Lys 
305 310 315 320 



Ser Tyr Glu Asp Leu Cys Asp Leu Arg Gly Asp Leu Trp Asp Phe 
325 330 335 



<210>21 
<211>890 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221>CDS 
<222> (55).. (738) 
<223> G545 

<400>21 



57 
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gcaaccttca aactaaaact cgagagacaa gaaatcctca gaatctttaa ctta atg 57 

Met 
1 

gcg etc gag get ctt aca tea cca aga tta get tct ccg att cct cct 105 
Ala Leu Glu Ala Leu Thr Ser Pro Arg Leu Ala Ser Pro lie Pro Pro 
5 10 15 

ttg ttc gaa gat tct tea gtc ttc cat gga gtc gag cac tgg aca aag 153 
Leu Phe Glu Asp Ser Ser Val Phe His Gly Val Glu His Trp Thr Lys 
20 25 30 

ggt aag cga tct aag aga tea aga tec gat ttc cac cac caa aac etc 201 
Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gin Asn Leu 
35 40 45 

act gag gaa gag tat eta get ttt tgc etc atg ctt etc get cgc gac 249 
Thr Glu Glu Glu Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg Asp 
50 55 " 60 65 



aac cgt cag cct cct cct cct ccg gcg gtg gag aag ttg age tac aag 297 

20 Asn Arg Gin Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr Lys 

70 75 80 

tgt age gtc tgc gac aag acg ttc tct tct tac caa get etc ggt ggt 345 

Cys Ser Val Cys Asp Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly Gly 

85 ~ 90 95 



25 



35 



40 



50 



55 



cac aag gca age cac cgt aag aac tta tea cag act etc tec ggc gga 
His LyB Ala Ser His Arg Lys Asn Leu Ser Gin Thr Leu Ser Gly Gly 
100 105 110 



cct tec ggt caa get etc ggc gga cac aag egg tgc cac tac gaa gga 
Pro Ser Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly 
150 155 160 



cct ccg ate cct gaa ttc teg atg gtc aac gga gac gac gaa gtc atg 
Pro Pro lie Pro Glu Phe Ser Met Val Asn Gly Asp Asp Glu Val Met 
45 1 9 5 2 0 0 2 0 5 

age cct atg ccg gcg aag aag cct egg ttt gac ttt ccg gtc aaa ctt 
Ser Pro Met Pro Ala Lys Lys Pro Arg Phe Asp Phe Pro val Lys Leu 
210 215 220 225 



<210> 22 
<211>227 



393 



gga gat gat cat tea acc teg teg gcg aca acc aca tec gec gtg act 441 

Gly Asp Asp His Ser Thr Ser Ser Ala Thr Thr Thr Ser Ala Val Thr 

115 120 125 

act gga agt ggg aaa tea cac gtt tgc acc ate tgt aac aag tct ttt 489 

Thr Gly Ser Gly Lys Ser His Val Cys Thr lie Cys Asn Lys Ser Phe 

130 135 140 145 



537 



aac aac aac ate aac act agt age gtg tec aac tec gaa ggt gcg ggg 585 

Asn Asn Asn He Asn Thr Ser Ser Val Ser Asn Ser Glu Gly Ala Gly 
165 170 175 

tec act age cac gtt age agt age cac cgt ggg ttt gac etc aac ate 633 

Ser Thr Ser His Val Ser Ser Ser His Arg Gly Phe Asp Leu Asn He 

180 185 190 



681 



729 



caa ctt taa ggaaatttac ttagacgata agatttegtt tgtatactgt 778 
Gin Leu 

tgagagttgt gtaggaattt gttgactgta cataccaaat tggactttga ctgattccaa 838 

ttcttcttgt tctttcattt taaaaattat taaaccgatt ctttaccaca aa 890 
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<212> PRT 

<213> Arabidopsis thaliana 



<400> 22 



Met Ala Leu Glu Ala Leu Thr Ser Pro Arg Leu Ala Ser Pro lie Pro 
15 10 15 

Pro Leu Phe Glu Asp Ser Ser Val Phe His Gly Val Glu His Trp Thr 
20 25 30 

Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gin Asn 
35 ^ ' 40 45 

Leu Thr Glu Glu Glu Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg 
50 55 60 

Asp Asn Arg Gin Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr 
65 70 75 80 



Lys Cys Ser Val Cys Asp Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly 
85 90 95 

Gly His Lys Ala Ser His Arg Lys- Asn Leu Ser Gin Thr Leu Ser Gly 
100 105 110 

Gly Gly Asp Asp His Ser Thr Ser Ser Ala Thr Thr Thr Ser Ala Val 
115 120 125 

Thr Thr Gly Ser Gly Lys Ser His Val Cys Thr lie Cys Asn Lys Ser 
130 135 140 

Phe Pro Ser Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu 
145 150 155 160 

Gly Asn Asn Asn He Asn Thr Ser Ser Val Ser Asn Ser Glu Gly Ala 
165 170 175 

Gly Ser Thr Ser His Val Ser Ser Ser His Arg Gly Phe Asp Leu Asn 
180 185 190 

He Pro Pro lie Pro Glu Phe Ser Met Val Asn Gly Asp Asp Glu Val 
195 200 205 

Met Ser Pro Met Pro Ala Lys Lys Pro Arg Phe Asp Phe Pro Val Lys 
210 215 220 



Leu Gin Leu 
225 



<210> 23 
<211> 1413 
<212> DNA 

<213> Arabidopsis thaliana 
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<220> 
<221>CDS 
<222> (86)..(1168) 
<223> G561 

5 

<400> 23 

aatttgtttt tttttctttt gtgggttcaa ttcgaattgt tttccctgag actcaagtta 60 

w ctgtgtcatt actctgcatt gagca atg ggt age aac gaa gaa gga aac ccc 112 

Met Gly Ser Asn Glu Glu Gly Asn Pro 
1 5 

act aac aac tct gat aag cca teg caa get get get cct gag cag agt 160 

Thr Asn Asn Ser Asp Lys Pro Ser Gin Ala Ala Ala Pro Glu Gin Ser 
10 15 20 25 

15 

aat gtt cat gtg tat cat cat gac tgg get get atg cag gca tat tat 208 

Asn Val His Val Tyr His His Asp Trp Ala Ala Met Gin Ala Tyr Tyr 

30 35 40 



20 



ggg cct aga gtt ggt ata cct caa tat tac aac tea aat ttg gcg cct 256 

Gly Pro Arg Val Gly He Pro Gin Tyr Tyr Asn Ser Asn Leu Ala Pro 
45 ' 50 55 

ggt cat get cca ccg cct tat atg tgg gcg tct cca teg cca atg atg 304 

Gly His Ala Pro Pro Pro Tyr Met Trp Ala Ser Pro Ser Pro Met Met 
60 65 70 



25 



30 



35 



40 



45 



50 



55 



60 



10 



20 



25 



55 
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get cct tat gga gca cca tat cca cca ttt tgc cct cct ggt gga gtt 352 
Ala Pro Tyr Gly Ala Pro Tyr Pro Pro Phe Cys Pro Pro Gly Gly Val 
75 80 85 



tat get cat cct ggt gtt caa atg ggc tea caa cca caa ggt cct gtt 
Tyr Ala His Pro Gly Val Gin Met Gly Ser Gin Pro Gin Gly Pro Val 
90 95 100 105 



ccc tea tct caa aac agt ctg cct ctt aga ggt gaa aat gag aaa ccc 
Pro Ser Ser Gin Asn Ser Leu Pro Leu Arg Gly Glu Asn Glu Lys Pro 
205 210 215 



400 



tct caa tea gca tct gga gtt aca acc cct ttg acc att gat gca cca 448 

Ser Gin Ser Ala Ser Gly Val Thr Thr Pro Leu Thr lie Asp Ala Pro 

110 115 120 

get aat tea get gga aac tea gat cat ggg ttc atg aaa aag ctg aaa 4 96 

Ala Asn Ser Ala Gly Asn Ser Asp His Gly Phe Met Lys Lys Leu Lys 

125 130 135 

gag ttc gat gga ctt gca^atg tea ata age aat aac aaa gtt ggg agt 544 

Glu Phe Asp Gly Leu Ala Met Ser He Ser Asn Asn Lys Val Gly Ser 

140 145 150 

get gaa cat age age agt gaa cat agg agt tct cag age tec gag aat 592 

Ala Glu His Ser Ser Ser Glu His Arg Ser Ser Gin Ser Ser Glu Asn 

155 160 165 

gat ggc tct age aat ggt agt gat ggt aat aca act ggg gga gaa caa 64 0 

Asp Gly Ser Ser Asn Gly Ser Asp Gly Asn Thr Thr Gly Gly Glu Gin 

170 175 180 185 

tct agg agg aaa aga agg caa caa aga tea cca age act ggt gaa aga 68 8 

Ser Arg Arg Lys Arg Arg Gin Gin Arg Ser Pro Ser Thr Gly Glu Arg 

190 195 200 



736 



gat gtg act atg ggg act cct gtt atg ccc aca gca atg agt ttc caa 784 
Asp Val Thr Met Gly Thr Pro Val Met Pro Thr Ala Met Ser Phe Gin 
30 220 225 230 

aac tct get ggc atg aac ggt gtg cca cag cca tgg aat gaa aaa gag 83 2 

Asn Ser Ala Gly Met Asn Gly Val Pro Gin Pro Trp Asn Glu Lys Glu 
235 240 245 

gtt aaa cga gag aag aga aaa cag tea aac cga gaa tct get agg agg 88 0 

35 val Lys Arg Glu Lys Arg Lys Gin Ser Asn Arg Glu Ser Ala Arg Arg 

250 255 260 265 

tea aga ctg agg aag cag get gaa aca gaa caa eta tct gtc aaa gtt 928 
Ser Arg Leu Arg Lys Gin Ala Glu Thr Glu Gin Leu Ser Val Lys Val 
270 275 280 

40 gac gca tta gta get gag aac atg tct ctg agg tct aaa eta ggc cag 976 

Asp Ala Leu Val Ala Glu Asn Met Ser Leu Arg Ser Lys Leu Gly Gin 
285 290 295 

eta aac aat gag tct gag aaa eta egg ctg gag aac gaa get ata ttg 1024 
Leu Asn Asn Glu Ser Glu Lys Leu Arg Leu Glu Asn Glu Ala He Leu 
300 305 310 

45 

gat caa ctg aaa gcg caa gca aca ggg aaa aca gag aac ctg ate tct 1072 
Asp Gin Leu Lys Ala Gin Ala Thr Gly Lys Thr Glu Asn Leu He Ser 
315 320 325 

cga gtt gat aag aac aac tct gta tea ggt age aaa act gtg cag cat 1120 
Arg Val Asp Lys Asn Asn Ser Val Ser Gly Ser Lys Thr Val Gin His 
50 330 335 340 345 

caa ctg tta aat gca agt ccg ata acc gat cct gtc gcg get age tga 1168 
Gin Leu Leu Asn Ala Ser Pro He Thr Asp Pro Val Ala Ala Ser 
350 355 360 



ccgtggccgc aacaatgaga acccgatatt tcttcctttg ggttgtgatt gtaacttaaa 1228 
aggagacttt ttgtttttat tcttagattt gtagctctct gcatagtgag cataaattga 1288 



61 
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tgtaatatgg tttaagagat tcggtgttct ctggtgtgtg ctgcaaccac ataattggtg 1348 

atagataggt ttagttatat aagcaaatgt attagagata aggggagaca tatttgatgg 1408 

5 tcttt 1413 

<210> 24 
<211>360 
10 <212>PRT 

<213> Arabidopsis thaliana 

<400> 24 

15 
20 
25 
30 
35 
40 
45 
50 
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Met Gly Ser Asn Glu Glu Gly Asn Pro Thr Asn Asn Ser Asp Lys Pro 
15 10 15 

Ser Gin Ala Ala Ala Pro Glu Gin Ser Asn Val His Val Tyr His His 
20 25 30 

Asp Trp Ala Ala Met Gin Ala Tyr Tyr Gly Pro Arg Val Gly lie Pro 
35 40 45 

Gin Tyr Tyr Asn Ser Asn Leu Ala Pro Gly His Ala Pro Pro Pro Tyr 
50 55 60 

Met Trp Ala Ser Pro Ser Pro Met Met Ala Pro Tyr Gly Ala Pro Tyr 
65 70 75 80 

Pro Pro Phe Cys Pro Pro Gly Gly Val Tyr Ala His Pro Gly Val Gin 
85 90 95 

Met Gly Ser Gin Pro Gin Gly Pro Val Ser Gin Ser Ala Ser Gly Val 
100 105 110 

Thr Thr Pro Leu Thr lie Asp Ala Pro Ala Asn Ser Ala Gly Asn Ser 
115 120 125 

Asp His Gly Phe Met Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met 
130 135 140 

Ser lie Ser Asn Asn Lys Val Gly Ser Ala Glu His Ser Ser Ser Glu 
145 150 155 160 

His Arg Ser Ser Gin Ser Ser Glu Asn Asp Gly Ser Ser Asn Gly Ser 
165 170 175 

Asp Gly Asn Thr Thr Gly Gly Glu Gin Ser Arg Arg Lys Arg Arg Gin 
180 185 190 

Gin Arg Ser Pro Ser Thr Gly Glu Arg Pro Ser Ser Gin Asn Ser Leu 
195 200 205 

Pro Leu Arg Gly Glu Asn Glu Lys Pro Asp Val Thr Met Gly Thr Pro 
210 215 220 

Val Met Pro Thr Ala Met Ser Phe Gin Asn Ser Ala Gly Met Asn Gly 
225 230 235 240 
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Val Pro Gin Pro Trp Asn Glu Lys Glu Val Lys Arg Glu Lye Arg Lys 
245 250 255 



Gin Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Leu Arg Lys Gin Ala 
260 265 270 



Glu Thr Glu Gin Leu Ser Val Lys Val Asp Ala Leu Val Ala Glu Asn 
275 280 285 



Met Ser Leu Arg Ser Lys Leu Gly Gin Leu Asn Asn Glu Ser Glu Lys 
290 ^ 295 300 



Leu Arg Leu Glu Asn Glu Ala lie Leu Asp Gin Leu Lys Ala Gin Ala 
305 310 315 320 



Thr Gly Lys Thr Glu Asn Leu lie Ser Arg Val Asp Lys Asn Asn Ser 
325 330 335 



Val Ser Gly Ser Lys Thr Val Gin His Gin Leu Leu Asn Ala Ser Pro 
340 345 350 



lie Thr Asp Pro Val Ala Ala Ser 
355 360 



<210> 25 
<211> 1087 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (104).. (952) 
<223> G664 

<400> 25 
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caatccctca atataaaata acaagtagaa ttgatctgcc tatatataag attttgagac 60 

gaaataagat ctaaaccaca agaaagaaag taaacataaa agt atg gga agg tea 115 

Met Gly Arg Ser 

1 

ccg tgc tgt gag aaa get cac aca aac aaa gga gca tgg acg aaa gaa 163 
Pro Cys Cys Glu Lys Ala His Thr Asn Lys Gly Ala Trp Thr Lye Glu 
5 10 15 " 20 

gag gac gag agg etc gtc gee tac att aaa get cat gga gaa ggc tgc 211 
w Glu Asp Glu Arg Leu Val Ala Tyr He Lys Ala His Gly Glu Gly Cys 

25 30 35 

tgg aga tct etc ccc aaa gee gee gga ctt ctt cgc tgt ggc aag age 259 
Trp Arg Ser Leu Pro Lys Ala Ala Gly Leu Leu Arg Cys Gly Lys Ser 
40 45 50 

15 tgc cgt etc egg tgg ate aac tat etc egg cct gac ctt aag cgt gga 307 

Cys Arg Leu Arg Trp He Asn Tyr Leu Arg Pro Asp Leu Lys Arg Gly 
55 60 65 

aac ttc acc gag gaa gaa gac gaa etc ate ate aag etc cat age ctt 355 
Asn Phe Thr Glu Glu Glu Asp Glu Leu He He Lys Leu His Ser Leu 
20 70 75 80 

ctt ggc aac aaa tgg teg ctt att gee ggg aga tta ccg gga aga aca 403 
Leu Gly Asn Lys Trp Ser Leu He Ala Gly Arg Leu Pro Gly Arg Thr 
85 * 90 95 100 



25 



gat aac gag ata aag aac tat tgg aac acg cat ata cga aga aag ctt 451 



30 



35 



40 



45 



50 



55 
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10 



Asp Asn Glu lie Lys Asn Tyr Trp Asn Thr His lie Arg Arg Lys Leu 
105 110 115 

ata aac aga ggg att gat cca acg agt cat aga cca ate caa gaa tea 4 99 

lie Asn Arg Gly lie Asp Pro Thr Ser His Arg Pro lie Gin Glu Ser 
120 125 130 

tea get tct caa gat tct aaa cct aca caa eta gaa cca gtt acg agt 547 
Ser Ala Ser Gin Asp Ser Lys Pro Thr Gin Leu Glu Pro Val Thr Ser 
135 140 145 

aat ace att aat ate tea ttc act tct get cca aag gtc gaa acg ttc 595 
Asn Thr lie Asn lie Ser Phe Thr Ser Ala Pro Lys Val Glu Thr Phe 
150 155 160 

cat gaa agt ata age ttt ccg gga aaa tea gag aaa ate tea atg ctt 643 
15 His Glu Ser lie Ser Phe Pro Gly Lys Ser Glu Lys lie Ser Met Leu 

165 170 175 180 

acg ttc aaa gaa gaa aaa gat gag tgc cca gtt caa gaa aag ttc cca 691 
Thr Phe Lys Glu Glu Lys Asp Glu Cys Pro Val Gin Glu Lys Phe Pro 
185 190 195 

20 gat ttg aat ctt gag etc aga ate agt ctt cct gat gat gtt gat cgt 739 

Asp Leu Asn Leu Glu Leu Arg lie Ser Leu Pro Asp Asp Val Asp Arg 
200 205 210 

ctt caa ggg cat gga aag tea aca acg cca cgt tgt ttc aag tgc age 787 
Leu Gin Gly His Gly Lys Ser Thr Thr Pro Arg Cys Phe Lys Cys Ser 
25 2 1 5 * 220 225 

tta ggg atg ata aac ggc atg gag tgc aga tgc gga aga atg aga tgc 835 
Leu Gly Met lie Asn Gly Met Glu Cys Arg Cys Gly Arg Met Arg Cys 
230 235 240 

gat gta gtc gga ggt age age aag ggg agt gac atg age aat gga ttt 883 
30 Asp Val Val Gly Gly Ser Ser Lys Gly Ser Asp Met Ser Asn Gly Phe 

245 250 255 260 

gat ttt tta ggg ttg gca aag aaa gag ace act tct ctt ttg ggc ttt 931 
Asp Phe Leu Gly Leu Ala Lys Lys Glu Thr Thr Ser Leu Leu Gly Phe 
265 270 275 

cga age ttg gag atg aaa taa tattgtcaaa ttttaggcgt aactgtacaa 982 
Arg Ser Leu Glu Met Lys 
280 

aacttttgee tagataattt gaaagtatat cttcaacttg tatgagaaat ttaactggtg 1042 
40 aattataata tatagaattt gttttttaaa aaaaaaaaaa aaaaa 1087 



<210> 26 
<211>282 
45 <212>PRT 

<213> Arabidopsis thaliana 

<400> 26 

50 



35 



55 
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Met Gly Arg Ser Pro Cys Cys Glu Lys Ala His Thr Asn Lys Gly Ala 
15 10 15 

Trp Thr Lys Glu Glu Asp Glu Arg Leu Val Ala Tyr lie Lys Ala His 
20 25 30 

Gly Glu Gly Cys Trp Arg Ser Leu Pro Lys Ala Ala Gly Leu Leu Arg 
35 * 40 45 

Cys Gly Lys Ser Cys Arg Leu Arg Trp lie Asn Tyr Leu Arg Pro Asp 
50 55 60 



Leu Lys Arg Gly Asn Phe Thr Glu Glu Glu Asp Glu Leu lie lie Lys 
65 70 75 80 

Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu lie Ala Gly Arg Leu 
85 90 95 

Pro Gly Arg Thr Asp Asn Glu He Lys Asn Tyr Trp Asn Thr His He 
100 ' 105 110 

Arg Arg Lys Leu He Asn Arg Gly lie Asp Pro Thr Ser His Arg Pro 
115 120 125 

He Gin Glu Ser Ser Ala Ser Gin Asp Ser Lys Pro Thr Gin Leu Glu 
130 135 140 

Pro Val Thr Ser Asn Thr He Asn He Ser Phe Thr Ser Ala Pro Lys 
145 150 155 160 

Val Glu Thr Phe His Glu Ser He Ser Phe Pro Gly Lys Ser Glu Lys 
165 170 175 

He Ser Met Leu Thr Phe Lys Glu Glu Lys Asp Glu Cys Pro Val Gin 
180 185 190 

Glu Lys Phe Pro Asp Leu Asn Leu Glu Leu Arg He Ser Leu Pro Asp 
.195 200 205 

Asp Val Asp Arg Leu Gin Gly His Gly Lys Ser Thr Thr Pro Arg Cys 
210 215 220 

Phe Lys Cys Ser Leu Gly Met lie Asn Gly Met Glu Cys Arg Cys Gly 
225 230 235 240 

Arg Met Arg Cys Asp Val Val Gly Gly Ser Ser Lys Gly Ser Asp Met 
245 250 255 

Ser Asn Gly Phe Asp Phe Leu Gly Leu Ala Lys Lys Glu Thr Thr Ser 
260 265 270 

Leu Leu Gly Phe Arg Ser Leu Glu Met Lys 
275 280 
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10 



20 



25 



35 



40 



45 



50 



55 



<213> Arabidopsis thaliana 

<220> 
<221>CDS 
<222> (1)..(480) 
<223> G91 1 

<400> 29 

atg ggt ctt cct gaa gat ttc ate acc gag ctt cag att cca ggt tac 48 

Met Gly Leu Pro Glu Asp Phe lie Thr Glu Leu Gin lie Pro Gly Tyr 
15 10 15 

ata tta aag ata ctt tac gtc ate ggt ttc ttt aga gac atg gtc gat 96 
lie Leu Lys He Leu Tyr Val He Gly Phe Phe Arg Asp Met Val Asp 
20 25 30 

get ctt tgt cct tac att ggt eta cct agt ttt eta gac cac aac gag 144 
Ala Leu Cys Pro Tyr He Gly Leu Pro Ser Phe Leu Asp His Asn Glu 
35 40 45 

acc tct gga ccc gat ccg acc cga cac get etc tct acg tea gcg agt 192 
Thr Ser Gly Pro Asp Pro Thr Arg His Ala Leu Ser Thr Ser Ala Ser 
50 55 60 

ctt get aac gag ttg ate ccg gtg gtt egg ttc teg gat ctt ccg acc 240 
Leu Ala Asn Glu Leu He Pro Val Val Arg Phe Ser Asp Leu Pro Thr 
65 70 75 80 

gat ccg gaa gat tgt tgt acg gtt tgt ttg tea gat ttt gag tec gac 288 
Asp Pro Glu Asp Cys Cys Thr Val Cys Leu Ser Asp Phe Glu Ser Asp 



85 90 95 

gat aag gtt agg cag eta ccc aag tgt gga cac gtg ttt cat cat cat 33 6 

Asp Lys Val Arg Gin Leu Pro Lys Cys Gly His Val Phe His His His 
100 105 110 

tgt tta gac cgt tgg ate gtt gac tac aac aag atg aaa tgt ccg gtt 384 
Cys Leu Asp Arg Trp He Val Asp Tyr Asn Lys Met Lys Cys Pro Val 
115 120 125 

tgt egg cac egg ttc tta ccg aaa gaa aag tac acg caa tgt gat tgg 432 
Cys Arg His Arg Phe Leu Pro Lys Glu Lys Tyr Thr Gin Cys Asp Trp 
130 135 140 

ggt tct ggt tea gat tgg ttt agt gat gaa gtg gaa agt acc aac taa 4 80 

Gly Ser Gly Ser Asp Trp Phe Ser Asp Glu Val Glu Ser Thr Asn 
145 150 155 



<210> 30 
<211> 159 
<212>prt 

<213> Arabidopsis thaliana 
<400> 30 
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Met Gly Leu Pro Glu Asp Phe lie Thr Glu Leu Gin lie Pro Gly Tyr 
1 5 10 15 

lie Leu Lys lie Leu Ty r v *l He Gly Phe Phe Arg Asp Met Val Asp 
20 25 30 

Ala Leu Cys Pro Tyr lie Gly Leu Pro Ser Phe Leu Asp His Asn Glu 
35 40 45 

Thr Ser Gly Pro Asp Pro Thr Arg His Ala Leu Ser Thr Ser Ala Ser 
50 55 60 

Leu Ala Asn Glu Leu lie Pro Val Val Arg Phe Ser Asp Leu Pro Thr 
65 70 75 80 

Asp Pro Glu Asp Cys Cys Thr Val Cys Leu Ser Asp Phe Glu Ser Asp 
85 90 95 

Asp Lys Val Arg Gin Leu Pro Lys Cys Gly His Val Phe His His His 
100 105 110 

Cys Leu Asp Arg Trp He Val Asp Tyr Asn Lys Met Lys Cys Pro Val 
115 120 125 

Cys Arg His Arg Phe Leu Pro Lys Glu Lys Tyr Thr Gin Cys Asp Trp 
130 135 140 

Gly Ser Gly Ser Asp Trp Phe Ser Asp Glu Val Glu Ser Thr Asn 
145 150 155 



<210>31 
<211> 1221 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (162)..(1013) 
<223> G964 

<400> 31 
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10 



20 



25 



atttctcttc cacaaagagt cctaacttcg agttgaaaca aacaccattt ctcatctcta 60 

tctcagaaag aacaaaccat ttcgtgttct ttctttctct attctcataa ggaaatataa 120 

ttcctgaaac tgttgagttc ttgtgaaagg aaataaaaaa c atg atg atg ggc aaa 176 

Met Met Met Gly Lys 
1 5 

gaa gat eta ggt ttg age eta age tta ggg ttt tea caa aat cac aat 224 
Glu Asp Leu Gly Leu Ser Leu Ser Leu Gly Phe Ser Gin Asn His Asn 
10 15 20 

cct ctt cag atg aat ctg aat cct aac tct tea tta tea aac aat etc 272 
Pro Leu Gin Met Asn Leu Asn Pro Asn Ser Ser Leu Ser Asn Asn Leu 
25 30 35 

cag aga etc cca tgg aac caa aca ttc gat cct aca tea gat ctt cgc 320 
Gin Arg Leu Pro Tip Asn Gin Thr Phe Asp Pro Thr Ser Asp Leu Arg 
40 45 50 

aag ata gac gtg aac agt ttt cca tea acg gtt aac tgc gag gaa gac 368 
Lys lie Asp Val Asn Ser Phe Pro Ser Thr Val Asn Cys Glu Glu Asp 
55 60 65 

aca gga gtt teg tea cca aac agt acg ate tea age ace att age ggg 416 
Thr Gly Val Ser Ser Pro Asn Ser Thr lie Ser Ser Thr lie Ser Gly 
70 75 80 85 

aag aga agt gag aga gaa gga ate tec gga acc ggc gtt ggc tec ggc 4 64 

Lys Arg Ser Glu Arg Glu Gly lie Ser Gly Thr Gly Val Gly Ser Gly 
90 95 100 

gac gat cac gac gag ate act ccg gat cga ggg tac tea cgt gga acc 512 
Asp Asp His Asp Glu lie Thr Pro Asp Arg Gly Tyr Ser Arg Gly Thr 
105 110 115 

tea gat gaa gaa gaa gac ggg ggc gaa acg teg agg aag aag etc agg 5 60 

Ser Asp Glu Glu Glu Asp Gly Gly Glu Thr Ser Arg Lys Lys Leu Arg 
30 120 125 130 

tta tea aaa gat cag tct get ttt etc gaa gag act ttc aaa gaa cac 6 08 

Leu Ser Lys Asp Gin Ser Ala Phe Leu Glu Glu Thr Phe Lys Glu His 
135 140 145 

35 aac act etc aat ccc aaa cag aag eta get ttg get aag aag ctg aac 656 

Asn Thr Leu Asn Pro Lys Gin Lys Leu Ala Leu Ala Lys Lys Leu Asn 
150 155 160 165 

ttg acg gca aga caa gtg gaa gtg tgg ttc caa aac aga aga get aga 704 

Leu Thr Ala Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg Ala Arg 

170 175 180 

40 

acc aag tta aag caa acg gag gta gat tgc gaa tac ttg aaa egg tgc 752 

Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu Tyr Leu Lys Arg Cys 

185 190 " 195 

gta gag aag eta acg gaa gag aac egg aga ctt cag aaa gag get atg 8 00 

45 Val Glu Lys Leu Thr Glu Glu Asn Arg Arg Leu Gin Lys Glu Ala Met 

200 205 210 

gag ctt cga act etc aag ctg tct cca caa ttc tac ggt cag atg act 848 
Glu Leu Arg Thr Leu Lys Leu Ser Pro Gin Phe Tyr Gly Gin Met Thr 
215 220 225 

50 cca cca act aca etc ate atg tgt cct teg tgc gag cgt gta get ggt 896 

Pro Pro Thr Thr Leu lie Met Cys Pro Ser Cys Glu Arg Val Ala Gly 
230 235 240 245 

cca tea tea teg aac cat cac cac aat cac agg ccg gtt teg att aac 944 
Pro Ser Ser Ser Asn His His His Asn His Arg Pro Val Ser lie Asn 
55 250 255 260 
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ccg tgg att get tgt get ggt cag gtg get cat ggg ctg aat ttt gaa 992 
Pro Trp lie Ala Cys Ala Gly Gin Val Ala His Gly Leu Asn Phe Glu 
265 270 275 

gee ttg cgt cca cga teg taa tttttagtgg tgggggaagg gtgttttggg 1043 
Ala Leu Arg Pro Arg Ser 
280 

ttttttcatt ategttatat agtctatctg tgtggggtca ttgtaatttt ggatgattgg 1103 

ccttctcatg aactagtcat atgtatgatg caaccttaaa aatatttcaa gtagcaaaac 1163 

ttaattacaa acttgetata ttaaccaaaa attatgaaaa aaaaaaaaaa aaaaaaaa 1221 



<210> 32 
<211>283 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 32 
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Met Met Met Gly Lys Glu Asp Leu Gly Leu Ser Leu Ser Leu Gly Phe 
15 10 15 

Ser Gin Asn His Asn Pro Leu Gin Met Asn Leu Asn Pro Asn Ser Ser 
20 25 30 

Leu Ser Asn Asn Leu Gin Arg Leu Pro Trp Asn Gin Thr Phe Asp Pro 
35 40 45 

Thr Ser Asp Leu Arg Lys He Asp Val Asn Ser Phe Pro Ser Thr Val 
50 55 60 

Asn Cys Glu Glu Asp Thr Gly Val Ser Ser Pro Asn Ser Thr He Ser 
65 70 75 80 

Ser Thr He Ser Gly Lys Arg Ser Glu Arg Glu Gly lie Ser Gly Thr 
85 90 95 

Gly Val Gly Ser Gly Asp Asp His Asp Glu He Thr Pro Asp Arg Gly 
100 105 110 

Tyr Ser Arg Gly Thr Ser Asp Glu Glu Glu Asp Gly Gly Glu Thr Ser 
115 120 125 

Arg Lys Lys Leu Arg Leu Ser Lys Asp Gin Ser Ala Phe Leu Glu Glu 
130 135 140 

Thr Phe Lys Glu His Asn Thr Leu Asn Pro Lys Gin Lys Leu Ala Leu 
145 150 155 160 

Ala Lys Lys Leu Asn Leu Thr Ala Arg Gin Val Glu Val Trp Phe Gin 
165 170 175 

Asn Arg Arg Ala Arg Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu 
180 185 190 

Tyr Leu Lys Arg Cys Val Glu Lys Leu Thr Glu Glu Asn Arg Arg Leu 
195 200 205 

Gin Lys Glu Ala Met Glu Leu Arg Thr Leu Lys Leu Ser Pro Gin Phe 



210 215 220 

Tyr Gly Gin Met Thr Pro Pro Thr Thr Leu He Met Cys Pro Ser Cys 
225 230 235 240 

Glu Arg Val Ala Gly Pro Ser Ser Ser Asn His His His Asn His Arg 
245 250 255 

Pro Val Ser He Asn Pro Trp He Ala Cys Ala Gly Gin val Ala His 
260 265 270 



Gly Leu Asn Phe Glu Ala Leu Arg Pro Arg Ser 
275 280 
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<210> 33 
<211> 1249 
<212> DNA 

<213> Arabidopsis thaliana 

5 

<220> 
<221> CDS 
<222> (82). .(91 8) 
<223> G394 

10 

<400> 33 

gaaattctta acaaacaatt ttcttcataa tattaattct caagatctta aagattatat 60 

15 taatacgaag agaaaattca a atg ggt ctt gat gat tea tgc aac aca ggt 111 

Met Gly Leu Asp Asp Ser Cys Asn Thr Gly 
15 10 

ctt gtt ctt ggt tta ggc etc tea cca acg cct aat aat tac aat cat 159 
Leu Val Leu Gly Leu Gly Leu Ser Pro Thr Pro Asn Asn Tyr Asn His 
15 ~ 20 25 

gec ate aag aaa tct tec tec act gtg gac cat cgt ttc ate agg etc 207 
Ala lie Lys Lys Ser Ser Ser Thr Val Asp His Arg Phe lie Arg Leu 
30 35 *" 40 

gat ccg teg ttg act eta age eta tec ggt gag age tac aag ate aag 255 
25 Asp Pro Ser Leu Thr Leu Ser Leu Ser Gly Glu Ser Tyr Lys lie Lys 

4 5 50 55 

act ggt gee ggc gec ggc gac caa att tgc egg cag acc teg tec cac 303 
Thr Gly Ala Gly Ala Gly Asp Gin lie Cys Arg Gin Thr Ser Ser His 
60 65 70 

30 age ggc ate tea tct ttc teg age gga agg gta aag aga gaa aga gaa 351 

Ser Gly lie Ser Ser Phe Ser Ser Gly Arg Val Lys Arg Glu Arg Glu 
75 * 80 85 90 

ate tec ggc ggc gat gga gaa gaa gag gcg gag gag acg acg gag aga 399 
He Ser Gly Gly Asp Gly Glu Glu Glu Ala Glu Glu Thr Thr Glu Arg 
35 95 100 105 

gtg gtg tgt teg aga gtg agt gat gat cat gac gat gaa gaa ggt gtt 44 7 

Val Val Cys Ser Arg Val Ser Asp Asp His Asp Asp Glu Glu Gly Val 
110 115 120 



20 



40 



45 



50 



agt get cgt aaa aag ctt aga etc act aaa caa caa tct get ctt etc 495 

Ser Ala Arg Lys Lys Leu Arg Leu Thr Lys Gin Gin Ser Ala Leu Leu 
125 130 135 

gaa gat aac ttc aaa ctt cat age acc ctt aat ccc aag caa aaa caa 54 3 

Glu Asp Asn Phe Lys Leu His Ser Thr Leu Asn Pro Lys Gin Lys Gin 
140 " 145 150 

get ctt gcg aga cag ctg aat eta agg cct aga caa gtt gaa gtg tgg 591 

Ala Leu Ala Arg Gin Leu Asn Leu Arg Pro Arg Gin Val Glu Val Trp 



55 



74 
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155 160 165 170 

ttc caa aac agg aga get aga aca aaa eta aag caa aca gaa gtg gat 639 

Phe Gin Asn Arg Arg Ala Arg Thr Lys Leu Lys Gin Thr Glu Val Asp 
175 180 185 

o 

tgt gag ttt ttg aag aaa tgt tgc gag act tta acg gat gag aat aga 687 

Cys Glu Phe Leu Lys Lys Cys Cys Glu Thr Leu Thr Asp Glu Asn Arg 
190 195 200 

agg ctt caa aaa gag ctt caa gac ctt aag get tta aaa ttg tct caa 735 
10 Leu Gin Lys Glu Leu Gin Asp Leu Lys Ala Leu Lys Leu Ser Gin 

205 210 215 

ccg ttt tac atg cac atg ccg gcg gcg act ttg act atg tgc cct tct 783 
Pro Phe Tyr Met His Met Pro Ala Ala Thr Leu Thr Met Cys Pro Ser 
220 225 230 

tgt gag aga etc ggc ggt ggt ggt gtc gga gga gat acg acg gcg gtt 831 
Cys Glu Arg Leu Gly Gly Gly Gly Val Gly Gly Asp Thr Thr Ala Val 
235 " 240 245 250 

gat gaa gaa acg gcg aaa gga get ttc tec ate gtc aca aag cct cgt 879 
Asp Glu Glu Thr Ala Lys Gly Ala Phe Ser lie Val Thr Lys Pro Arg 
20 2 5 5 2 6 0 2 6 5 

ttc tat aac cct ttc act aat cct tct gca gca tgt tag ttacttatta 928 
Phe Tyr Asn Pro Phe Thr Asn Pro Ser Ala Ala Cys 
270 275 

at-i-at-t-t-aat- i 

25 



gttatttaat 


tctttttgtt 


ggtttttttt 


ttgtttctta 


aatcaaatta 


ggaattagtt 


988 


agaagataaa 


tcccagggaa 


aaaatattac gttgaaattg 


gggggaaatg 


gggtatagtc 


1048 


tttatagata 


agactcttca 


acgattccac 


tttatttttc 


ggtgggattg 


ttggttgatg 


1108 


aagaaaaaaa 


aatagtttgt 


aattacaggt 


ttaaatatgt 


agagaaaaaa 


tgacgaatat 


1168 


gtattatctt 


gttttttttt 


ccttcgaata 


tgtattaegg 


taatataaat 


ttgcttgtaa 


1228 


aaataataaa 


tatattattt 


g 








1249 



35 <210>34 
<211>278 



<212> PRT 

<213> Arabidopsis thaliana 



40 <400> 34 



45 



50 



55 
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Met Gly Leu Asp Asp Ser Cys Asn Thr Gly Leu Val Leu Gly Leu Gly 
15 10 15 

Leu Ser Pro Thr Pro Asn Asn Tyr Asn His Ala lie Lys Lys Ser Ser 
20 25 30 

Ser Thr Val Asp His Arg Phe lie Arg Leu Asp Pro Ser Leu Thr Leu 
35 40 45 

Ser Leu Ser Gly Glu Ser Tyr Lys lie Lys Thr Gly Ala Gly Ala Gly 
50 55 60 

Asp Gin lie Cys Arg Gin Thr Ser Ser His Ser Gly He Ser Ser Phe 
65 70 75 80 

Ser Ser Gly Arg Val Lys Arg Glu Arg Glu He Ser Gly Gly Asp Gly 
85 90 ~ 95 

Glu Glu Glu Ala Glu Glu Thr Thr Glu Arg Val Val Cys Ser Arg Val 



100 105 110 

Ser Asp Asp His Asp Asp Glu Glu Gly Val Ser Ala Arg Lys Lys Leu 
115 120 125 

Arg Leu Thr Lys Gin Gin Ser Ala Leu Leu Glu Asp Asn Phe Lys Leu 
130 135 140 

His Ser Thr Leu Asn Pro Lys Gin Lys Gin Ala Leu Ala Arg Gin Leu 
145 150 155 160 

Asn Leu Arg Pro Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg Ala 
165 170 175 

Arg Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu Phe Leu Lys Lys 
180 185 " 190 

Cys Cys Glu Thr Leu Thr Asp Glu Asn Arg Arg Leu Gin Lys Glu Leu 
195 200 ~ ' 205 

Gin Asp Leu Lys Ala Leu Lys Leu Ser Gin Pro Phe Tyr Met His Met 
210 215 220 

Pro Ala Ala Thr Leu Thr Met Cys Pro Ser Cys Glu Arg Leu Gly Gly 
225 230 235 240 

Gly Gly Val Gly Gly Asp Thr Thr Ala Val Asp Glu Glu Thr Ala Lys 
245 250 255 

Gly Ala Phe Ser He Val Thr Lys Pro Arg Phe Tyr Asn Pro Phe Thr 
260 265 270 

Asn Pro Ser Ala Ala Cys 
275 
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<210> 35 
<211> 1147 
<212> DNA 

<213> Arabidopsis thaliana 

5 

<220> 
<221> CDS 
<222> (33). .(695) 
<223> G489 

10 

<400> 35 

tggatcaaca agaccatgga cagtctggag ct atg aac tat ggc aca aac cca 53 

Met Asn Tyr Gly Thr Asn Pro 

15 1 5 

tac caa acc aac ccg atg age acc act get get act gta gca gga ggt 101 
Tyr Gin Thr Asn Pro Met Ser Thr Thr Ala Ala Thr Val Ala Gly Gly 
10 15 20 

20 9 C 9 gca caa cca ggc cag ctg gcg ttc cac cag ate cat cag cag cag 14 9 

Ala Ala Gin Pro Gly Gin Leu Ala Phe His Gin He His Gin Gin Gin 
25 30 35 

cag cag caa cag ctg gca cag cag ctt caa gca ttt tgg gag aac caa 197 

Gin Gin Gin Gin Leu Ala Gin Gin Leu Gin Ala Phe Trp Glu Asn Gin 
40 45 50 55 

25 



35 



40 



45 



50 



55 
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10 



ttc aaa gag att gag aag act acc gat ttc aag aac cac age ctt ccc 24 5 

Phe Lys Glu lie Glu Lys Thr Thr Asp Phe Lys Asn His Ser Leu Pro 
60 65 70 

ctt gcg aga ate aag aaa ate atg aaa gcg gat gaa gat gtc cgt atg 293 

Leu Ala Arg lie Lya Lys lie Met Lys Ala Asp Glu Asp Val Arg Met 

75 80 85 

ate teg get gag gcg ccg gtc gtg ttt gca agg gec tgt gag atg ttc 341 

lie Ser Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu Met Phe 

90 95 100 

ate ctg gag ctg aca etc agg teg tgg aac cac act gag gag aat aag 389 

lie Leu Glu Leu Thr Leu Arg Ser Trp Asn His Thr Glu Glu Asn Lys 

105 110 115 

agg egg acg ttg cag aag aac gat att get get get gtg act aga acc 43 7 

Arg Arg Thr Leu Gin Lys Asn Asp lie Ala Ala Ala Val Thr Arg Thr 

120 ~ 125 130 135 

gat att ttt gat ttc ctt gtg gat att gtt ccc egg gag gat etc cga 485 

Asp lie Phe Asp Phe Leu Val Asp lie Val Pro Arg Glu Asp Leu Arg 
14 0 14 5 150 

gat gaa gtc ttg gga agt att ccg agg ggc act gtc ccg gaa get get 53 3 

Asp Glu Val Leu Gly Ser lie Pro Arg Gly Thr Val Pro Glu Ala Ala 

155 160 165 

get get ggt tac ccg tat gga tac ttg cct gca gga act get cca ata 581 

Ala Ala Gly Tyr Pro Tyr Gly Tyr Leu Pro Ala Gly Thr Ala Pro He 

25 170 ** 175 180 

gga aat ccg gga atg gtt atg ggt aat ccc ggt ggt gcg tat cca cct 629 

Gly Asn Pro Gly Met Val Met Gly Asn Pro Gly Gly Ala Tyr Pro Pro 

185 190 195 

aat cct tat atg ggt caa cca atg tgg caa caa cag gca cct gac caa 677 

30 Asn Pro Tyr Met Gly Gin Pro Met Trp Gin Gin Gin Ala Pro Asp Gin 

200 " 205 210 215 

cct gac cag gaa aat tag caagaaactg tgagtcttcc agettcgegg 72 5 

Pro Asp Gin Glu Asn 
220 

35 



20 



40 



45 



ccgctctaga 


caggcctcgt 


accggatcct 


ctagctagag 


etttegtteg 


tatcateggt 


785 


ttcgacaacg 


ttcgtcaagt 


teaatgeate 


agtttcattg 


cgcacacacc 


agaatcctac 


845 


tgagtttgag 


tattatggca 


ttgggaaaac 


tgtttttctt 


gtccatttgt 


tgtgcttgta 


905 


atttactgtg 


ttttttattc 


ggttttcgct 


atcgaactgt 


gaaatggaaa 


tggatggaga 


965 


agagttaatg 


aatgatatgg 


ccttttgttc 


attctcaaat 


taatattatt 


tggtttttct 


1025 


cttatttgtg 


gggatgaatt 


tgaaattata 


agagatatgc 


aaacattttg 


tttgagtaaa 


1085 


atgtgcaaat 


cgtggcctct 


aatgacccga 


agttaatatg 


aggagtaaaa 


cacttgtagg 


1145 


tg 
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<210> 36 
50 <211>220 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 36 

55 



78 
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Met Asn Tyr Gly Thr Asn Pro Tyr Gin Thr Asn Pro Met Ser Thr Thr 
15 10 15 

Ala Ala Thr Val Ala Gly Gly Ala Ala Gin Pro Gly Gin Leu Ala Phe 
20 25 30 



His Gin lie His Gin Gin Gin Gin Gin Gin Gin Leu Ala Gin Gin Leu 
35 40 45 

Gin Ala Phe Trp Glu Asn Gin Phe Lys Glu lie Glu Lys Thr Thr Asp 
50 55 60 

Phe Lys Asn His Ser Leu Pro Leu Ala Arg lie Lys Lys lie Met Lys 
65 70 75 BO 

Ala Asp Glu Asp Val Arg Met lie Ser Ala Glu Ala Pro Val Val Phe 
85 90 95 

Ala Arg Ala Cys Glu Met Phe lie Leu Glu Leu Thr Leu Arg Ser Trp 
100 105 110 

Asn His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gin Lys Asn Asp lie 
115 120 125 

Ala Ala Ala Val Thr Arg Thr Asp lie Phe Asp Phe Leu Val Asp lie 
130 135 140 

Val Pro Arg Glu Asp Leu Arg Asp Glu Val Leu Gly Ser lie Pro Arg 
145 150 155 160 

Gly Thr Val Pro Glu Ala Ala Ala Ala Gly Tyr Pro Tyr Gly Tyr Leu 
165 170 175 

Pro Ala Gly Thr Ala Pro lie Gly Asn Pro Gly Met Val Met Gly Asn 
180 185 190 

Pro Gly Gly Ala Tyr Pro Pro Asn Pro Tyr Met Gly Gin Pro Met Trp 
195 200 205 

Gin Gin Gin Ala Pro Asp Gin Pro Asp Gin Glu Asn 
210 ~ 215 220 



<210> 37 
<211> 1262 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221>CDS 
<222> (21 7). .(957) 
<223> G463 

<400> 37 
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ctcgagccac gtcaggggtc tctttctgtt tgtttgtttt cttgtttcct tctctctctc 

tctttctttc tttgtcttcc tttcccaggt tgtttttttt tgctctctct gccttcttga 

ctttcaaaag actctttctt tcttttggat tgattttgga ttctagggct ctctttcttt 

tagtgggttt ttgttgttgt tgttgtggtc tctctg atg att act gaa ctt gag 

Met He Thr Glu Leu Glu 
1 5 

atg ggg aaa ggt gag agt gag ctt gag ctt ggt eta ggg ctg agt ctt 

Met Gly Lys Gly Glu Ser Glu Leu Glu Leu Gly Leu Gly Leu Ser Leu 
10 15 20 



80 
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10 



20 



45 



50 



ggc ggt gga acg gcg gcc aag att ggt aaa tea ggt ggt ggt ggc gcg 330 

Gly Gly Gly Thr Ala Ala Lys He Gly Lys Ser Gly Gly Gly Gly Ala 

25 30 35 

tgg gga gag cgt gga agg ctt ttg acg get aag gat ttt cct tct gtt 3 78 

Trp Gly Glu Arg Gly Arg Leu Leu Thr Ala Lys Asp Phe Pro Ser Val 
40 45 50 

ggt tct aaa cgt get get gat tct get tct cat get ggt tea tct cct 426 

Gly Ser Lys Arg Ala Ala Asp Ser Ala Ser His Ala Gly Ser Ser Pro 

55 "* 60 65 70 

cct cgt tea agt caa gtt gtt gga tgg cct cct ata ggg tea cac agg 4 74 

Pro Arg Ser Ser Gin Val Val Gly Trp Pro Pro He Gly Ser His Arg 

75 80 85 

atg aac agt ttg gtt aat aac caa get aca aag tea gca aga gaa gaa 522 

Met Asn Ser Leu Val Asn Asn Gin Ala Thr Lys Ser Ala Arg Glu Glu 

90 95 100 

gaa gaa get ggt aag aag aaa gtg aaa gat gat gaa cct aaa gat gtg 570 

Glu Glu Ala Gly Lys Lys Lys Val Lys Asp Asp Glu Pro Lys Asp Val 

105 * 110 115 



aca aag aaa gtg aat ggg aaa gta caa gtt gga ttt att aag gtg aac 
Thr Lys Lys Val Asn Gly Lys Val Gin Val Gly Phe He Lys Val Asn 
120 125 130 



55 <210>38 
<211>246 



<212> PRT 

<213> Arabidopsis thaliana 



618 



atg gat gga gtt get ata gga aga aaa gtg gat ttg aat get cat tct 666 
Met Asp Gly Val Ala He Gly Arg Lys Val Asp Leu Asn Ala His Ser 
25 135 140 * 145 150 

tct tac gag aat ttg gcg caa aca ttg gaa gat atg ttc ttt cgc act 714 
Ser Tyr Glu Asn Leu Ala Gin Thr Leu Glu Asp Met Phe Phe Arg Thr 
155 160 165 

aat ccg ggt act gtc ggg tta ace agt cag ttc act aaa ccg ttg agg 762 
30 Asn Pro Gly Thr Val Gly Leu Thr Ser Gin Phe Thr Lys Pro Leu Arg 

170 175 180 

ctt tta gat gga teg tct gag ttt gta ctt act tat gaa gat aag gaa 810 

Leu Leu Asp Gly Ser Ser Glu Phe Val Leu Thr Tyr Glu Asp Lys Glu 
185 190 195 

35 

gga gat tgg atg ctt gtt ggt gat gtt cca tgg aga atg ttc ate aac 858 

Gly Asp Trp Met Leu Val Gly Asp Val Pro Trp Arg Met Phe lie Asn 
200 205 210 

teg gtg aaa agg eta cgt gtg atg aaa acc tct gaa get aat gga etc 906 
Ser Val Lys Arg Leu Arg Val Met Lys Thr Ser Glu Ala Asn Gly Leu 
40 215 220 225 230 

get gca cga aat caa gaa cca aac gag aga cag cga aag cag ccg gtt 954 
Ala Ala Arg Asn Gin Glu Pro Asn Glu Arg Gin Arg Lys Gin Pro Val 
235 240 245 



tag atctcttttc gaegttaegg tgttacaggt tttatatttt ggggttttgc 1007 

aagtctgaga tacttctgaa gcaagcataa gctagattga tcttatatcc agtttgtgta 1067 

ttttcttggt tcttataatg gtttttactg gttttcttta gttttttttt ttgctgtctt 1127 

ttaattttcg gttgcgattt cactatatac tatggatgga agagaatget ctttatatct 1187 

tttactacac tgtaaatatt tgaagcttat etaatategt ttttaagggt taaaaaaccc 1247 

tgaegtagee tcgag 1262 
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<400> 38 



Met lie Thr Glu Leu Glu Met Gly Lys Gly Glu Ser Glu Leu Glu Leu 
15 10 15 

Gly Leu Gly Leu Ser Leu Gly Gly Gly Thr Ala Ala Lye lie Gly Lys 
20 25 30 

Ser Gly Gly Gly Gly Ala Trp Gly Glu Arg Gly Arg Leu Leu Thr Ala 
35 40 45 

Lys Asp Phe Pro Ser Val Gly Ser Lys Arg Ala Ala Asp Ser Ala Ser 
50 55 60 

His Ala Gly Ser Ser Pro Pro Arg Ser Ser Gin Val Val Gly Trp Pro 
65 ~ 70 75 80 

Pro lie Gly Ser His Arg Met Asn Ser Leu Val Asn Asn Gin Ala Thr 
85 90 95 

Lys Ser Ala Arg Glu Glu Glu Glu Ala Gly Lys Lys Lys Val Lys Asp 
100 105 110 

Asp Glu Pro Lys Asp Val Thr Lys Lys Val Asn Gly Lys Val Gin Val 
115 120 125 

Gly Phe He Lys Val Asn Met Asp Gly Val Ala He Gly Arg Lys Val 
130 135 140 

Asp Leu Asn Ala His Ser Ser Tyr Glu Asn Leu Ala Gin Thr Leu Glu 
145 150 155 160 

Asp Met Phe Phe Arg Thr Asn Pro Gly Thr Val Gly Leu Thr Ser Gin 
165 170 175 

Phe Thr Lys Pro Leu Arg Leu Leu Asp Gly Ser Ser Glu Phe Val Leu 
180 185 190 

Thr Tyr Glu Asp Lys Glu Gly Asp Trp Met Leu val Gly Asp Val Pro 
195 200 205 

Trp Arg Met Phe He Asn Ser Val Lys Arg Leu Arg Val Met Lys Thr 
210 215 220 

Ser Glu Ala Asn Gly Leu Ala Ala Arg Asn Gin Glu Pro Asn Glu Arg 
225 230 235 240 

Gin Arg Lys Gin Pro Val 
245 



<210> 39 
<211> 905 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 
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<221>CDS 
<222> (76).. (837) 
<223> G767 

5 <400> 39 



10 



15 



20 



25 



30 



35 



40 



45 



50 



83 



EP 1 230 345 B1 



cgaagtttca actttcaaac atatctttac agttctttct tgctaaacaa caataaaggg 60 

gaaaataggt taatt atg atg aaa tct ggg get gat ttg caa ttt cca cca 111 
Met Met Lys Ser Gly Ala Asp Leu Gin Phe Pro Pro 

5 1 5 10 

gga ttt aga ttt cat cct acg gat gag gag eta gtc etc atg tat etc 159 
Gly Phe Arg Phe His Pro Thr Asp Glu Glu Leu Val Leu Met Tyr Leu 
15 20 25 

tgt cgt aaa tgc gcg teg cag ccg ate cct get ccg att ate ace gaa 207 
10 cys Arg Lys Cys Ala Ser Gin Pro He Pro Ala Pro He He Thr Glu 

30 35 40 

etc gat ttg tac cga tat gat cct tgg gac ctt ccc gac atg get ttg 255 
Leu Asp Leu Tyr Arg Tyr Asp Pro Trp Asp Leu Pro Asp Met Ala Leu 
45 * " 50 55 60 

tac ggt gaa aag gag tgg tat ttt ttc tea cca aga gat cga aag tat 3 03 

Tyr Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro Arg Asp Arg Lys Tyr 
65 70 75 

cca aac ggt tea aga ccc aac cgt gca get ggt act gga tat tgg aaa 3 51 

Pro Asn Gly Ser Arg Pro Asn Arg Ala Ala Gly Thr Gly Tyr Trp Lys 
20 80 85 90 

get acc gga get gat aaa cca ata ggt cgt cct aaa ccg gtt ggt att 3 99 

Ala Thr Gly Ala Asp Lys Pro He Gly Arg Pro Lys Pro Val Gly He 
95 100 105 



25 



aag aag get eta gtg ttt tac teg gga aaa cct cca aat gga gag aaa 447 

Lys Lys Ala Leu Val Phe Tyr Ser Gly Lys Pro Pro Asn Gly Glu Lys 

110 115 120 

acc aat tgg att atg cac gaa tac egg etc'get gac gtt gac egg teg 495 

Thr Asn Trp He Met His Glu Tyr Arg Leu Ala Asp Val Asp Arg Ser 

125 130 135 140 

30 

gtt cgt aag aaa aac agt eta aga ttg gac gat tgg gta ttg tgt cgt 543 

Val Arg Lys Lys Asn Ser Leu Arg Leu Asp Asp Trp Val Leu Cys Arg 
145 150 155 

ata tat aac aag aaa ggt gtc ate gag aag cga cga age gat ate gag 591 

He Tyr Asn Lys Lys Gly Val He Glu Lys Arg Arg Ser Asp He Glu 

35 160 165 170 

gac ggg tta aag cct gtg act gac acg tgt cca ccg gaa tct gtg gcg 639 

Asp Gly Leu Lys Pro Val Thr Asp Thr Cys Pro Pro Glu Ser Val Ala 

175 180 185 



40 



45 



aga ttg ate tec ggc teg gag caa gcg gtg tea ccg gaa ttc acg tgt 687 

Arg Leu lie Ser Gly Ser Glu Gin Ala Val Ser Pro Glu Phe Thr Cys 
190 195 200 

age aac ggt egg ttg agt aat gee ctt gat ttt ccg ttt aat tac gta 735 
Ser Asn Gly Arg Leu Ser Asn Ala Leu Asp Phe Pro Phe Asn Tyr Val 
205 ~ 210 215 220 

gat gec ate gec gat aac gag att gtg tea egg eta ttg ggc ggg aat 783 

Asp Ala He Ala Asp Asn Glu He Val Ser Arg Leu Leu Gly. Gly Asn 
225 230 235 

cag atg tgg teg acg acg ctt gat cca ctt gtg gtt agg cag gga act 831 

Gin Met Trp Ser Thr Thr Leu Asp Pro Leu Val Val Arg Gin Gly Thr 
240 245 250 

ttc tga gttgtcacgt gcgattagag ttagtggaaa gtggaaacta tcactgtctg 887 
Phe 

55 ttttcgcacg tgteggge 905 



<210> 40 



50 
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<211>253 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 40 



Met Met Lys Ser Gly Ala Asp Leu Gin Phe Pro Pro Gly Phe Arg Phe 
1^5 10 15 

His Pro Thr Asp Glu Glu Leu Val Leu Met Tyr Leu Cys Arg Lys Cys 
20 25 3 0 



Ala Ser Gin Pro lie Pro Ala Pro lie lie Thr Glu Leu Asp Leu Tyr 
35 40 45 

Arg Tyr Asp Pro Trp Asp Leu Pro Asp Met Ala Leu Tyr Gly Glu Lys 
50 55 60 

Glu Trp Tyr Phe Phe Ser Pro Arg Asp Arg Lys Tyr Pro Asn Gly Ser 
65 70 75 80 

Arg Pro Asn Arg Ala Ala Gly Thr Gly Tyr Trp Lys Ala Thr Gly Ala 
85 90 95 

Asp Lys Pro lie Gly Arg Pro Lys Pro Val Gly lie Lys Lys Ala Leu 
100 105 110 

Val Phe Tyr Ser Gly Lys Pro Pro Asn Gly Glu Lys Thr Asn Trp He 
115 120 125 

Met His Glu Tyr Arg Leu Ala Asp Val Asp Arg Ser Val Arg Lys Lys 
130 ~ 135 140 

Asn Ser Leu Arg Leu Asp Asp Trp Val Leu Cys Arg He Tyr Asn Lys 
145 150 155 160 

Lys Gly Val He Glu Lys Arg Arg Ser Asp He Glu Asp Gly Leu Lys 
165 170 175 

Pro Val Thr Asp Thr Cys Pro Pro Glu Ser Val Ala Arg Leu He Ser 
180 185 190 

Gly Ser Glu Gin Ala Val Ser Pro Glu Phe Thr Cys Ser Asn Gly Arg 
195 200 205 

Leu Ser Asn Ala Leu Asp Phe Pro Phe Asn Tyr Val Asp Ala He Ala 
210 ' 215 220 

Asp Asn Glu He Val Ser Arg Leu Leu Gly Gly Asn Gin Met Trp Ser 
225 230 235 240 

Thr Thr Leu Asp Pro Leu Val Val Arg Gin Gly Thr Phe 
245 250 



<210>41 
<211> 1479 
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<212> DNA 

<213> Arabidopsis thaliana 

<220> 
5 <221>CDS 

<222> (192). .(962) 
<223> G765 

<400> 41 

10 
15 
20 
25 
30 
35 
40 
45 
50 
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tttaactgca tagttcttcc tcaaattgct catacctttg cttctctttc tttctttatt 60 

caacctctta tttcttcttc agcaccaacg tgtatctgca gtaaacacat atccatcgat 120 

cgatctcctc aaaaagttat tgttttcttg aaggattttt cttgttcttg atcaagcata 180 

catatatata g atg gtg gaa gaa ggc ggc gta gtt gtg aat caa gga gga 230 
Met Val Glu Glu Gly Gly Val Val Val Asn Gin Gly Gly 
15 10 

gat cag gag gtg gtg gat ttg cct ccg ggg ttt egg ttt cac cct act 278 
Asp Gin Glu Val Val Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr 
15 20 25 

gat gaa gag ata ata act cac tac etc aaa gag aag gtc ttc aac ate 326 

Asp Glu Glu He He Thr His Tyr Leu Lys Glu Lys Val Phe Asn He 

30 35 40 45 

15 

cga ttt acc gcg gca gcg att ggt caa gec gac etc aac aag aac gag 374 

Arg Phe Thr Ala Ala Ala He Gly Gin Ala Asp Leu Asn Lys Asn Glu 
50 55 60 

cca tgg gat eta cca aag att gca aag atg ggg gag aag gag ttt tac 422 
Pro Trp Asp Leu Pro Lys He Ala Lys Met Gly Glu Lys Glu Phe Tyr 
20 65 70 75 

ttt ttc tgc cag agg gat egg aag tat ccg acc ggg atg agg acg aac 470 
Phe Phe Cys Gin Arg Asp Arg Lys Tyr Pro Thr Gly Met Arg Thr Asn 
80 85 90 



25 



cgt gcg acc gtg tct ggt tat tgg aag gcg acc ggg aag gac aag gag 518 
Arg Ala Thr Val Ser Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu 
95 100 105 

ate ttt aga ggc aaa ggt tgt ctt gtt ggg atg aag aaa aca ctt gtt 566 
He Phe Arg Gly Lys Gly Cys Leu Val Gly Met Lys Lys Thr Leu Val 
HO 115 120 125 

ttc tat aca gga aga get cct aaa ggt gaa aag acc aat tgg gtt atg 614 
Phe Tyr Thr Gly Arg Ala Pro Lys Gly Glu Lys Thr Asn Trp Val Met 
130 135 140 

cat gaa tat cgt ctt gat gga aaa tat tct tat cat aac etc ccc aaa 662 
His Glu Tyr Arg Leu Asp Gly Lys Tyr Ser Tyr His Asn Leu Pro Lys 
145 150 155 

acc gca agg gat gaa tgg gtg gtg tgt agg gtt ttt cac aag aac get 710 
Thr Ala Arg Asp Glu Trp Val Val Cys Arg Val Phe His Lys Asn Ala 
160 165 170 

40 cct agt act aca ate act act aca aaa caa etc tea agg att gat tct 758 

Pro Ser Thr Thr He Thr Thr Thr Lys Gin Leu Ser Arg He Asp Ser 
175 180 185 

ctt gat aac att gat cat etc tta gac ttc tea tct etc cct cct etc 806 
Leu Asp Asn He Asp His Leu Leu Asp Phe Ser Ser Leu Pro Pro Leu 
190 195 200 205 



35 



45 



50 



55 



ata gat ccg ggt ttc ttg ggt caa ccc gee caa get tct ccg gtg ccc 854 

He Asp Pro Gly Phe Leu Gly Gin Pro Ala Gin Ala Ser Pro Val Pro 
210 215 220 

gtc aac aac acg ate tea aac ctg tct cca cca tec tac aac cgc acc 902 

Val Asn Asn Thr He Ser Asn Leu Ser Pro Pro Ser Tyr Asn Arg Thr 
225 230 235 

agt cga caa cac tta cct tec tac cca age tct caa ttt ccc tta cca 950 
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Ser Arg Gin His Leu Pro Ser Tyr Pro Ser Ser Gin Pne Pro Leu Pro 
240 245 250 

etc ggt ccc taa ttceggatet gatttegget aeggggcagg ttcaggcaat 
Leu Gly Pro 
255 

aataacaaag gtatgatcaa gttggagcat tctcttgtga gtgtgtctca agaaaceggt 

ttgagttccg atgtgaacac aaccgcaacg ccagagatat cttcttatcc aatgatgatg 

aatccggcaa tgatggatgg tagcaagtca gcgtgtgatg gtcttgatga cttgatcttc 

tgggaagatt tatatactag ctaaatttgg gaaaaggtta tttgttaatt gtgattgaag 

agtggcatat tgattactcg tctagtgttt ttaatcgtgt aattagttcg tatataatat 

acatgtacat aagatcatta ggtttattag gcattggact ttagttcggt gattgettae 

ctagttttta gcttgagaaa aaaggctgtc attggggtta tgtttctttg tgattaactt 

gtacatatat acatttaaat taaacgtatg gtttaaatcg tttaaaaaaa aaaaaaa 



<210> 42 
<211>256 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 
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Met Val Glu Glu Gly Gly Val Val Val Asn Gin Gly Gly Asp Gin Glu 
15 10 15 

Val Val Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu Glu 
20 25 30 

lie lie Thr His Tyr Leu Lys Glu Lys Val Phe Asn lie Arg Phe Thr 
35 40 45 

Ala Ala Ala He Gly Gin Ala Asp Leu Asn Lys Asn Glu Pro Trp Asp 
50 55 60 

Leu Pro Lys He Ala Lys Met Gly Glu Lys Glu Phe Tyr Phe Phe Cys 
65 70 75 80 

Gin Arg Asp Arg Lys Tyr Pro Thr Gly Met Arg Thr Asn Arg Ala Thr 
85 90 95 

Val Ser Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu He Phe Arg 
100 105 110 

Gly Lys Gly Cys Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr Thr 
115 120 125 

Gly Arg Ala Pro Lys Gly Glu Lys Thr Asn Trp Val Met His Glu Tyr 
130 135 140 

Arg Leu Asp Gly Lys Tyr Ser Tyr His Asn Leu Pro Lys Thr Ala Arg 
145 150 155 160 

Asp Glu Trp Val Val Cys Arg Val Phe His Lys Asn Ala Pro Ser Thr 
165 170 175 



Thr He Thr Thr Thr Lys Gin Leu Ser Arg He Asp Ser Leu Asp Asn 
180 185 190 

He Asp His Leu Leu Asp Phe Ser Ser Leu Pro Pro Leu He Asp Pro 
195 200 205 

Gly Phe Leu Gly Gin Pro Ala Gin Ala Ser Pro Val Pro Val Asn Asn 
210 215 220 

Thr He Ser Asn Leu Ser Pro Pro Ser Tyr Asn Arg Thr Ser Arg Gin 
225 230 235 240 

His Leu Pro Ser Tyr Pro Ser Ser Gin Phe Pro Leu Pro Leu Gly Pro 
245 250 255 



<210> 43 
<211> 825 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 
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<221>CDS 
<222> (1)..(825) 
<223>G197 

5 <400> 43 

atg gga agg tct cct tgc tgt gag aaa gac cac aca aac aaa gga get 48 
Met Gly Arg Ser Pro Cys Cys Glu Lys Asp His Thr Asn Lys Gly Ala 
15 10 15 

10 

tgg act aag gaa gaa gac gat aag etc ate tct tac ate aaa get cac 96 
Trp Thr Lys Glu Glu Asp Asp Lys Leu lie Ser Tyr He Lys Ala His 
20 25 ' 30 

ggt gaa ggt tgt tgg cgt tct ctt cct aga tec gee ggt ctt caa cgt 144 
Gly Glu Gly Cys Trp Arg Ser Leu Pro Arg Ser Ala Gly Leu Gin Arg 
15 35 40 45 

tgc gga aaa age tgt cgt etc cga tgg att aac tat etc cga cct gat 192 
Cys Gly Lys Ser Cys Arg Leu Arg Trp lie Asn Tyr Leu Arg Pro Asp 
50 55 60 

20 etc aag agg ggt aac ttc ace etc gaa gaa gat gat etc ate ate aaa 240 

Leu Lys Arg Gly Asn Phe Thr Leu Glu Glu Asp Asp Leu He He Lys 
65 70 75 80 

eta cat age ctt etc ggt aac aag tgg tct ctt att gcg acg aga tta 2B8 

Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu He Ala Thr Arg Leu 

85 90 95 

25 

cca gga aga aca gat aac gag att aag aat tac tgg aac aca cat gtt 336 

Pro Gly Arg Thr Asp Asn Glu He Lys Asn Tyr Trp Asn Thr His Val 
100 105 110 

aag agg aag eta tta aga aaa ggg att gat ccg gcg act cat cga cct 3 84 

qn Lys Arg Lys Leu Leu Arg Lys Gly He Asp Pro Ala Thr His Arg Pro 

115 120 125 

ate aac gag ace aaa act tct caa gat teg tct gat tct agt aaa aca 432 
lie Asn Glu Thr Lys Thr Ser Gin Asp Ser Ser Asp Ser Ser Lys Thr 
130 135 ~ 140 

35 gag gac cct ctt gtc aag att etc tct ttt ggt cct cag ctg gag aaa 480 

Glu Asp Pro Leu Val Lys He Leu Ser Phe Gly Pro Gin Leu Glu Lys 
145 150 155 160 

ata gca aat ttc ggg gac gag aga att caa aag aga gtt gag tac tea 5 28 

He Ala Asn Phe Gly Asp Glu Arg He Gin Lys Arg Val Glu Tyr Ser 
165 170 175 

40 



45 



50 



55 
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gtt gtt gaa gaa aga tgt ctg gac ttg aat ctt gag ctt agg ate agt 576 

Val Val Glu Glu Arg Cys Leu Asp Leu Asn Leu Glu Leu Arg lie Ser 

180 185 190 

cca cca tgg caa gac aag etc cat gat gag agg aac eta agg ttt ggg 624 

Pro Pro Trp Gin Asp Lys Leu His Asp Glu Arg Asn Leu Arg Phe Gly 

195 200 205 

aga gtg aag tat agg tgc agt gcg tgc cgt ttt gga ttc ggg aac ggc 672 

Arg Val Lys Tyr Arg Cys Ser Ala Cys Arg Phe Gly Phe Gly Asn Gly 

210 215 220 

aag gag tgt age tgt aat aat gtg aaa tgt caa aca gag gac agt agt 720 

Lys Glu Cys Ser Cys Asn Asn Val Lys Cys Gin Thr Glu Asp Ser Ser 

225 230 235 240 

age age agt tat tct tea acc gac att agt agt age att ggt tat gac 768 

15 Ser Ser Ser Tyr Ser Ser Thr Asp lie Ser Ser Ser He Gly Tyr Asp 

245 250 255 

ttc ttg ggt eta aac aac act agg gtt ttg gat ttt age act ttg gaa 816 

Phe Leu Gly Leu Asn Asn Thr Arg Val Leu Asp Phe Ser Thr Leu Glu 

260 265 270 

20 atg aaa tga 825 

Met Lys 



<210> 44 
25 <211>274 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 44 



35 



40 



45 



50 



55 
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Met Gly Arg Ser Pro Cys Cys Glu Lys Asp His Thr Asn Lys Gly Ala 
15 10 15 

Trp Thr Lys Glu Glu Asp Asp Lys Leu He Ser Tyr He Lys Ala His 
20 25 30 

Gly Glu Gly Cys Trp Arg Ser Leu Pro Arg Ser Ala Gly Leu Gin Arg 
35 40 4 5 

Cys Gly Lys Ser Cys Arg Leu Arg Trp lie Asn Tyr Leu Arg Pro Asp 
50 55 60 

Leu Lys Arg Gly Asn Phe Thr Leu Glu Glu Asp Asp Leu He He Lys 
65 70 75 80 

Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu He Ala Thr Arg Leu 
85 90 95 

Pro Gly Arg Thr Asp Asn Glu He Lys Asn Tyr Trp Asn Thr His Val 
100 105 110 

Lys Arg Lys Leu Leu Arg Lys Gly He Asp Pro Ala Thr His Arg Pro 
115 120 125 

He Asn Glu Thr Lys Thr Ser Gin Asp Ser Ser Asp Ser Ser Lys Thr 
130 135 140 

Glu Asp Pro Leu Val Lys He Leu Ser Phe Gly Pro Gin Leu Glu Lys 
145 150 155 160 



He Ala Asn Phe Gly Asp Glu Arg He Gin Lys Arg Val Glu Tyr Ser 
165 170 175 

Val Val Glu Glu Arg Cys Leu Asp Leu Asn Leu Glu Leu Arg He Ser 
180 185 190 

Pro Pro Trp Gin Asp Lys Leu His Asp Glu Arg Asn Leu Arg Phe Gly 
195 200 205 

Arg Val Lys Tyr Arg Cys Ser Ala Cys Arg Phe Gly Phe Gly Asn Gly 
210 215 220 

Lys Glu Cys Ser Cys Asn Asn Val Lys Cys Gin Thr Glu Asp Ser Ser 
225 " 230 235 240 

Ser Ser Ser Tyr Ser Ser Thr Asp He Ser Ser Ser He Gly Tyr Asp 
245 250 255 

Phe Leu Gly Leu Asn Asn Thr Arg Val Leu Asp Phe Ser Thr Leu Glu 
260 265 270 



Met Lys 
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<211> 918 
<212> DNA 

<213> Arabidopsis thalina 

5 <220> 

<221>CDS 
<222> (30). .(839) 
<223> G255 

w <400> 45 

agcatcatca tcatcagaag aagagagtc atg gga aga tct cct tgc tgc gag 53 

Met Gly Arg Ser Pro Cys Cys Glu 
1 5 

aaa gaa cac atg aac aaa ggt get tgg act aaa gaa gaa gat gag aga 101 
Lys Glu His Met Asn Lys Gly Ala Trp Thr Lys Glu Glu Asp Glu Arg 
10 15 20 

eta gtc tct tac ate aag tct cac ggt gaa ggt tgt tgg cga tct ctt 149 
Leu Val Ser Tyr lie Lys Ser His Gly Glu Gly Cys Trp Arg Ser Leu 
20 25 30 35 40 

cct aga gec get ggt etc ctt cgc tgc ggt aaa age tgc cgt ctt egg 197 
Pro Arg Ala Ala Gly Leu Leu Arg Cys Gly Lys Ser Cys Arg Leu Arg 
45 50 55 



25 



tgg att aac tat etc cga cct gat etc aaa aga gga aac ttt aca cat 245 

Trp lie Asn Tyr Leu Arg Pro Asp Leu Lys Arg Gly Asn Phe Thr His 
60 65 70 

gat gaa gat gaa ctt ate ate aag ctt cat age etc eta ggc aac aag 293 

Asp Glu Asp Glu Leu lie lie Lys Leu His Ser Leu Leu Gly Asn Lys 

75 80 85 

30 

tgg tct ttg att gcg gcg aga tta cct gga aga aca gat aac gag ate 341 

Trp Ser Leu lie Ala Ala Arg Leu Pro Gly Arg Thr Asp Asn Glu lie 

90 95 100 

aag aac tac tgg aac aca cat ata aag agg aag ctt ttg age aaa ggg 389 

35 



40 



45 



50 



55 



93 



EP 1 230 345 B1 



10 



Lys Asn Tyr Trp Asn Thr His lie Lys Arg Lys Leu Leu Ser Lys Gly 
105 110 115 120 

att gat cca gcc act cat aga ggg ate aac gag gca aaa att tct gat 437 
lie Asp Pro Ala Thr His Arg Gly lie Asn Glu Ala Lys He Ser Asp 
125 130 135 

ttg aag aaa aca aag gac caa att gta aaa gat gtt tct ttt gtg aca 485 
Leu Lys Lys Thr Lys Asp Gin He Val Lys Asp Val Ser Phe Val Thr 
140 ^ 145 150 

aag ttt gag gaa aca gac aag tct ggg gac cag aag caa aat aag tat 533 
Lys Phe Glu Glu Thr Asp Lys Ser Gly Asp Gin Lys Gin Asn Lys Tyr 
155 160 * ~ 165 

att cga aat ggg tta gtt tgc aaa gaa gag aga gtt gtt gtt gaa gaa 581 
He Arg Asn Gly Leu Val Cys Lys Glu Glu Arg Val Val Val Glu Glu 
15 170 175 180 

aaa ata ggc cca gat ttg aat ctt gag ctt agg ate agt cca cca tgg 629 
Lys He Gly Pro Asp Leu Asn Leu Glu Leu Arg He Ser Pro Pro Trp 
185 " 190 195 200 

20 caa aac cag aga gaa ata tct act tgc act gcg tec cgt ttt tac atg 677 

Gin Asn Gin Arg Glu He Ser Thr Cys Thr Ala Ser Arg Phe Tyr Met 
205 210 215 

gaa aac gac atg gag tgt agt agt gaa act gtg aaa tgt caa aca gag 725 
Glu Asn Asp Met Glu Cys Ser Ser Glu Thr Val Lys Cys Gin Thr Glu 
220 225 230 

25 

aat agt age age att age tat tct tct att gat att agt agt agt aac 773 
Asn Ser Ser Ser He Ser Tyr Ser Ser He Asp He Ser Ser Ser Asn 
235 240 245 

gtt ggt tat gac ttc ttg ggt ttg aag aca aga att ttg gat ttt cga 821 
30 Val Gly Tyr Asp Phe Leu Gly Leu Lys Thr Arg He Leu Asp Phe Arg 

250 255 260 

age ttg gaa atg aaa taa atgaatagta ttagattctt aatttgtagg 869 
Ser Leu Glu Met Lys 
26 5 

35 tctgataatg aatgttagat tcgcggccct ctagacaggc ctcgtaccg 918 



<210> 46 
<211>269 
40 <212>PRT 

<213> Arabidopsis thalina 

<400> 46 



50 



55 
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Met Gly Arg Ser Pro Cys Cys Glu Lys Glu His Met Asn Lys Gly Ala 
1 5 10 15 

Trp Thr Lys Glu Glu Asp Glu Arg Leu Val Ser Tyr lie Lys Ser His 
20 25 30 

Gly Glu Gly Cys Trp Arg Ser Leu Pro Arg Ala Ala Gly Leu Leu Arg 
35 40 45 

Cys Gly Lys Ser Cys Arg Leu Arg Trp lie Asn Tyr Leu Arg Pro Asp 
50 55 60 

Leu Lys Arg Gly Asn Phe Thr His Asp Glu Asp Glu Leu lie lie Lys 
65 70 75 80 

Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu lie Ala Ala Arg Leu 
85 90 95 



Pro Gly Arg Thr Asp Asn Glu lie Lys Asn Tyr Trp Asn Thr His lie 
100 105 110 

Lys Arg Lys Leu Leu Ser Lye Gly lie Asp Pro Ala Thr His Arg Gly 
115 120 125 

lie Asn Glu Ala Lys He Ser Asp Leu Lys Lys Thr Lys Asp Gin He 
130 135 140 

Val Lys Asp Val Ser Phe Val Thr Lys Phe Glu Glu Thr Asp Lys Ser 
145 150 155 160 

Gly Asp Gin Lys Gin Asn Lys Tyr He Arg Asn Gly Leu Val Cys Lys 
165 ' " 170 175 

Glu Glu Arg Val Val Val Glu Glu Lys He Gly Pro Asp Leu Asn Leu 
180 185 190 

Glu Leu Arg He Ser Pro Pro Trp Gin Asn Gin Arg Glu He Ser Thr 
195 200 205 

Cys Thr Ala Ser Arg Phe Tyr Met Glu Asn Asp Met Glu Cys Ser Ser 
210 ~ 215 220 

Glu Thr Val Lys Cys Gin Thr Glu Asn Ser Ser Ser He Ser Tyr Ser 
225 * 230 235 240 

Ser He Asp He Ser Ser Ser Asn Val Gly Tyr Asp Phe Leu Gly Leu 
245 250 255 

Lys Thr Arg He Leu Asp Phe Arg Ser Leu Glu Met Lys 
260 265 



<210> 47 
<211>660 
<212> DNA 

<213> Arabidopsis thaliana 
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<220> 
<221>CDS 
<222> (48). .(521) 
<223>G1113 

5 

<400> 47 

aagctttatc tctttctttc tctcctctct atctttctct cacactc atg ggt ctt 56 

Met Gly Leu 

w 1 

cca aca gat ttc aaa gag ctt cag att cca ggt tac gta eta aaa aca 104 
Pro Thr Asp Phe Lys Glu Leu Gin lie Pro Gly Tyr Val Leu Lys Thr 
5 ' " 10 15 

ctt tac gtc ate ggt ttc ttt aga gac atg gtc gat get ctt tgt cct 152 
Leu Tyr Val He Gly Phe Phe Arg Asp Met Val Asp Ala Leu Cys Pro 
20 25 30 35 

tac ate ggt eta cca agt ttt ctt gac cac aac gag acc tct cga tec 200 
Tyr He Gly Leu Pro Ser Phe Leu Asp His Asn Glu Thr Ser Arg Ser 
40 45 50 



20 



25 



35 



45 



55 



gac ccg acc cga etc get etc tec acg tea gca act ctt gee aac gag 24 8 



Asp Pro Thr Arg Leu Ala Leu Ser Thr Ser Ala Thr Leu Ala Asn Glu 
55 60 65 

tta ate ccg gtg gtt cgt ttc tec gat ctt tta acc gat ccg gaa gat 296 
Leu He Pro Val Val Arg Phe Ser Asp Leu Leu Thr Asp Pro Glu Asp 
70 75 80 

tgc tgc acg gtt tgc tta tec gat ttt gta tec gac gat aag att aga 344 
Cys Cys Thr Val Cys Leu Ser Asp Phe Val Ser Asp Asp Lys He Arg 
85 * 90 95 

cag ctg ccg aag tgt gga cac gtg ttt cat cat cgt tgt tta gac cgt 392 
Gin Leu Pro Lys Cys Gly His Val Phe His His Arg Cys Leu Asp Arg 
100 105 110 115 

tgg ate gtt gac tgt aat aag ata acg tgc ccg att tgt egg aac egg 440 
Trp He Val Asp Cys Asn Lys He Thr Cys Pro He Cys Arg Asn Arg 
120 125 130 

ttc tta ccg gag gaa aag tec acg ccg ttt gat tgg ggt act tea gat 488 
Phe Leu Pro Glu Glu Lys Ser Thr Pro Phe Asp Trp Gly Thr Ser Asp 
40 135 140 145 

tgg ttt aga gat gaa gtg gag agt acc aac taa taatgatggt tttactttta 541 
Trp Phe Arg Asp Glu Val Glu Ser Thr Asn 
150 155 



ctttttactt ttttcacggt aatatttttc tactgtataa ttctttcttc caaactactg 601 
tataattcaa gtataagatt atgtaattgt gtatattagc atcaatcatc tttctttgt 660 



<210> 48 
50 <211> 157 



<212> PRT 

<213> Arabidopsis thaliana 
<400> 48 
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Met Gly Leu Pro Thr Asp Phe Lys Glu Leu Gin lie Pro Gly Tyr Val 
15 10 15 



Leu Lys Thr Leu Tyr Val He Gly Phe Phe Arg Asp Met Val Asp Ala 
20 25 30 



Leu Cys Pro Tyr lie Gly Leu Pro Ser Phe Leu Asp His Asn Glu Thr 
35 40 45 



Ser Arg Ser Asp Pro Thr Arg Leu Ala Leu Ser Thr Ser Ala Thr Leu 
50 55 60 



Ala Asn Glu Leu He Pro Val Val Arg Phe Ser Asp Leu Leu Thr Asp 
65 70 75 80 



Pro Glu Asp Cys Cys Thr Val Cys Leu Ser Asp Phe Val Ser Asp Asp 
85 90 95 



Lys He Arg Gin Leu Pro Lys Cys Gly His Val Phe His His Arg Cys 
100 105 110 



Leu Asp Arg Trp He Val Asp Cys Asn Lys He Thr Cys Pro He Cys 
115 120 125 



Arg Asn Arg Phe Leu Pro Glu Glu Lys Ser Thr Pro Phe Asp Trp Gly 
130 135 140 



Thr Ser Asp Trp Phe Arg Asp Glu Val Glu Ser Thr Asn 
145 150 155 



<210> 49 
<211> 1201 
<212> DNA 

<213> Arabldopsis thaliana 

<220> 
<221> CDS 
<222> (148). .(996) 
<223> G398 

<400> 49 
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agaaggtttc tcttgtcctc catacactta gcacaactga taaatctttt gaggtaaaat 60 

cagctttaga tcaaggtttt tctagtcatc tctactcata aagatcaaag cttttgctat 120 

tctcattttc taccaagaga caatatc atg atg atg ggt aaa gag gat ttg ggt 174 
5 Met Met Met Gly Lys Glu Asp Leu Gly 

1 5 

tta agt ctt age ttg gga ttt gca caa aac cat cct etc cag eta aat 222 

Leu Ser Leu Ser Leu Gly Phe Ala Gin Asn His Pro Leu Gin Leu Asn 

10 15 20 25 

10 

ctt aaa ccc act tct tea cca atg tec aat etc cag atg ttt cca tgg 2 70 

Leu Lys Pro Thr Ser Ser Pro Met Ser Asn Leu Gin Met Phe Pro Trp 
30 35 40 

aac caa ace ctt gtt tct tec tea gat caa caa aag caa cag ttt ctt 318 

Asn Gin Thr Leu Val Ser Ser Ser Asp Gin Gin Lys Gin Gin Phe Leu 
15 45 50 55 

agg aaa ate gac gtg aac age ttg cca aca acg gtg gat ttg gaa gag 366 

Arg Lys lie Asp Val Asn Ser Leu Pro Thr Thr Val Asp Leu Glu Glu 
60 65 70 

20 gag aca gga gtt teg tct cca aac agt acg ate teg age aca gtg agt 414 

Glu Thr Gly Val Ser Ser Pro Asn Ser Thr lie Ser Ser Thr Val Ser 
75 * 80 85 

gga aag agg agg agt act gaa aga gaa ggt ace tec ggt ggt ggt tgc 462 

Gly Lys Arg Arg Ser Thr Glu Arg Glu Gly Thr Ser Gly Gly Gly Cys 

90 95 100 105 

25 

gga gat gac ctt gac ate act eta gat aga tct tec tea cgt gga acc 510 

Gly Asp Asp Leu Asp He Thr Leu Asp Arg Ser Ser Ser Arg Gly Thr 
110 115 120 

tec gat gaa gag gaa gat tac gga ggt gag act tgt agg aag aag ctt 55 8 

Ser Asp Glu Glu Glu Asp Tyr Gly Gly Glu Thr Cys Arg Lys Lys Leu 
30 125 130 135 

aga eta tec aaa gat caa tec gca gtt etc gaa gac act ttc aaa gag 606 

Arg Leu Ser Lys Asp Gin Ser Ala Val Leu Glu Asp Thr Phe Lys Glu 
140 145 150 

35 cac aat act etc aat ccc aaa cag aag ctg get ttg get aag aag eta 654 

His Asn Thr Leu Asn Pro Lys Gin Lys Leu Ala Leu Ala Lys Lys Leu 
155 160 165 

ggt tta aca gca aga caa gtg gaa gtg tgg ttc caa aac aga aga gca 702 

Gly Leu Thr Ala Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg Ala 

170 175 180 185 

40 

agg aca aag tta aag cag acc gaa gtg gat tgc gag tat ttg aaa aga 750 

Arg Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu Tyr Leu Lys Arg 
190 195 " 200 

tgt gtt gag aaa tta acg gaa gag aat egg egg etc gag aaa gag gca 798 

45 Cys Val Glu Lys Leu Thr Glu Glu Asn Arg Arg Leu Glu Lys Glu Ala 

205 210 215 



50 



55 
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gcg gaa eta aga gca tta aag ctt tea ccg egg ttg tat ggt cag atg 846 

Ala Glu Leu Arg Ala Leu Lys Leu Ser Pro Arg Leu Tyr Gly Gin Met 

220 225 230 

agt cca ccg ace aca ctt ttg atg tgt cca teg tgt gaa cgt gtg gec 894 

Ser Pro Pro Thr Thr Leu Leu Met Cys Pro Ser Cys Glu Arg Val Ala 

235 240 245 

gga cca tec tea tct aac cac aac cag cga tct gtc tea ttg agt cca 942 

Gly Pro Ser Ser Ser Asn His Asn Gin Arg Ser Val Ser Leu Ser Pro 

250 255 260 265 

tgg etc caa atg gee cat ggg tea acc ttt gat gtg atg cgt cct agg 990 

Trp Leu Gin Met Ala His Gly Ser Thr Phe Asp Val Met Arg Pro Arg 

270 275 280 

tct taa etttaatget gcttctatgg gttgtgtgtg ggtcattgta ctttttagat 1046 
15 Ser 

tattgactct cagctaatgt atccttaaaa gectttttet acttttaaat ttactttaat 1106 

ctaattaaat tagttgtcca tgtcttcttg ataacaaaaa aatttataat tataaaaaaa 1166 

20 aaaaacagga taaaaaaaaa aaaaaaaaaa aaaaa 1201 



<210> 50 
<211>282 
25 <212>PRT 

<213> Arabidopsis thaliana 

<400> 50 



35 



40 



45 



50 



55 
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Met Met Met Gly Lys Glu Asp Leu Gly Leu Ser Leu Ser Leu Gly Phe 
15 10 15 

Ala Gin Asn His Pro Leu Gin Leu Asn Leu Lys Pro Thr Ser Ser Pro 
20 25 30 

Met Ser Asn Leu Gin Met Phe Pro Trp Asn Gin Thr Leu Val Ser Ser 
35 40 ~ 45 

Ser Asp Gin Gin Lys Gin Gin Phe Leu Arg Lys lie Asp Val Asn Ser 
50 55 60 

Leu Pro Thr Thr Val Asp Leu Glu Glu Glu Thr Gly Val Ser Ser Pro 
65 70 75 80 

Asn Ser Thr lie Ser Ser Thr Val Ser Gly Lys Arg Arg Ser Thr Glu 
85 90 95 

Arg Glu Gly Thr Ser Gly Gly Gly Cys Gly Asp Asp Leu Asp lie Thr 
100 * 105 * 110 

Leu Asp Arg Ser Ser Ser Arg Gly Thr Ser Asp Glu Glu Glu Asp Tyr 
115 120 125 

Gly Gly Glu Thr Cys Arg Lys Lys Leu Arg Leu Ser Lys Asp Gin Ser 
130 135 140 

Ala Val Leu Glu Asp Thr Phe Lys Glu His Asn Thr Leu Asn Pro Lys 
145 150 155 160 



Gin Lys Leu Ala Leu Ala Lys Lys Leu Gly Leu Thr Ala Arg Gin Val 
165 170 175 

Glu Val Trp Phe Gin Asn Arg Arg Ala Arg Thr Lys Leu Lys Gin Thr 
180 185 ^ 190 

Glu Val Asp Cys Glu Tyr Leu Lys Arg Cys Val Glu Lys Leu Thr Glu 
195 200 205 

Glu Asn Arg Arg Leu Glu Lys Glu Ala Ala Glu Leu Arg Ala Leu Lys 
210 " ~ 215 220 

Leu Ser Pro Arg Leu Tyr Gly Gin Met Ser Pro Pro Thr Thr Leu Leu 
225 230 235 240 

Met Cys Pro Ser Cys Glu Arg Val Ala Gly Pro Ser Ser Ser Asn His 
245 ~ 250 255 

Asn Gin Arg Ser Val Ser Leu Ser Pro Trp Leu Gin Met Ala His Gly 
260 265 270 

Ser Thr Phe Asp Val Met Arg Pro Arg Ser 

275 280 
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<210>51 
<211>937 
<212> DNA 

<213> Arabidopsis thaliana 

5 

<220> 
<221> CDS 
<222> (120). .(797) 
<223> G395 

10 

<400>51 

gcacaatccc actatccttc gcaagaccct tcctctatat aaggaagttc atttcatttg 60 

15 gagaggacac gctgacaagc tgactctagc agatctggta ccgtcgacaa ggaggaaga 119 

atg ccc tta gga gca get acg gtt gtg gag gag gaa gag gag gag gag 167 
Met Pro Leu Gly Ala Ala Thr Val Val Glu Glu Glu Glu Glu Glu Glu 
15 10 15 

gaa gcg gtg cct agt atg tea gta teg ccg ccg gat agt gta acg teg 215 
Glu Ala Val Pro Ser Met Ser Val Ser Pro Pro Asp Ser Val Thr Ser 
20 25 30 

teg ttt caa ttg gac ttt ggg att aaa agt tat ggt tat gag aga aga 263 
Ser Phe Gin Leu Asp Phe Gly lie Lys Ser Tyr Gly Tyr Glu Arg Arg 
35 40 45 

age aat aag aga gat att gat gat gaa gtg gag aga tea gee tea aga 311 
Ser Asn Lys Arg Asp lie Asp Asp Glu Val Glu Arg Ser Ala Ser Arg 
50 55 60 

gee age aac gaa gac aac gat gac gag aat gga tec act agg aag aaa 359 
qn Ala Ser Asn Glu Asp Asn Asp Asp Glu Asn Gly Ser Thr Arg Lys Lys 

65 70 75 80 

ctt aga etc tec aaa gac caa tct get ttt ctt gaa gac age ttc aaa 407 
Leu Arg Leu Ser Lys Asp Gin Ser Ala Phe Leu Glu Asp Ser Phe Lys 
85 90 95 

35 gaa cac agt ace ctt aat cct aaa cag aag att gca ttg gcg aag cag 455 

Glu His Ser Thr Leu Asn Pro Lys Gin Lys lie Ala Leu Ala Lys Gin 
100 105 110 



20 



25 



40 



45 



50 



55 
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ttg aat ctt cgt cct cgt cag gtt gaa gtc tgg ttt caa aac aga cga 
Leu Asn Leu Arg Pro Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg 
115 120 125 

gcc agg aca aag ctg aag caa acg gaa gtg gac tgt gaa tac eta aag 
Ala Arg Thr Lys Leu Lye Gin Thr Glu Val Asp Cys Glu Tyr Leu Lys 
130 135 140 

aga tgc tgt gag tea eta ace gaa gaa aac egg agg ctt caa aaa gag 
Arg Cys Cys Glu Ser Leu Thr Glu Glu Asn Arg Arg Leu Gin Lys Glu 
145 150 155 160 

gtt aaa gaa ttg aga acc ttg aag act tec aca ccc ttt tac atg caa 
Val Lys Glu Leu Arg Thr Leu Lys Thr Ser Thr Pro Phe Tyr Met Gin 
165 170 175 

ctt ccg gcc act act etc act atg tgc cct tct tgt gaa cgt gtt gcc 
Leu Pro Ala Thr Thr Leu Thr Met Cys Pro Ser Cys Glu Arg Val Ala 
180 185 190 

act tea gca gca cag ccc tec acg tea get gcc cac aac etc tgt ttg 
Thr Ser Ala Ala Gin Pro Ser Thr Ser Ala Ala His Asn Leu Cys Leu 
195 200 205 

tec acg tea tea ttg att ccg gtt aag cct egg ccg gcc aaa caa gtt 
Ser Thr Ser Ser Leu He Pro Val Lys Pro Arg Pro Ala Lys Gin Val 
210 215 220 

tea tga aagcacctgc gaaatacagt ttgagcaaac gggcggccgc tctagacagg 

Ser 

225 

cctcgtaccg gatcctctag ctagagcttt cgttcgtatc ateggttteg acaaegtteg 
tcaagttcaa tgacatcagt ttgattgege 



<210> 52 
<211>225 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 52 
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Met Pro Leu Gly Ala Ala Thr Val Val Glu Glu Glu Glu Glu Glu Glu 
15 10 15 

Glu Ala Val Pro Ser Met Ser Val Ser Pro Pro Asp Ser Val Thr Ser 
20 25 30 

Ser Phe Gin Leu Asp Phe Gly He Lys Ser Tyr Gly Tyr Glu Arg Arg 
35 40 45 

Ser Asn Lys Arg Asp He Asp Asp Glu Val Glu Arg Ser Ala Ser Arg 
50 55 60 

Ala Ser Asn Glu Asp Asn Asp Asp Glu Asn Gly Ser Thr Arg Lys Lys 
65 70 75 80 

Leu Arg Leu Ser Lys Asp Gin Ser Ala Phe Leu Glu Asp Ser Phe Lys 
B5 90 95 

Glu His Ser Thr Leu Asn Pro Lys Gin Lys He Ala Leu Ala Lys Gin 
100 105 110 

Leu Asn Leu Arg Pro Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg 
115 120 125 



Ala Arg Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu Tyr Leu Lys 
130 * 135 140 

Arg Cys Cys Glu Ser Leu Thr Glu Glu Asn Arg Arg Leu Gin Lys Glu 
145 150 155 160 

Val Lys Glu Leu Arg Thr Leu Lys Thr Ser Thr Pro Phe Tyr Met Gin 
165 170 175 

Leu Pro Ala Thr Thr Leu Thr Met Cys Pro Ser Cys Glu Arg Val Ala 
180 185 190 

Thr Ser Ala Ala Gin Pro Ser Thr Ser Ala Ala His Asn Leu Cys Leu 
195 200 205 

Ser Thr Ser Ser Leu He Pro Val Lys Pro Arg Pro Ala Lys Gin Val 
210 215 220 

Ser 
225 



<210> 53 
<211>927 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (37).. (861) 
<223> G393 
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<400> 53 

aaattttaaa ggcatatttt tttcatttcg actcag atg ggt ttt gat gat aca 54 

Met Gly Phe Asp Asp Thr 
1 5 

tgc aac aca ggt ctt gtt ctt gga tta ggt ccc tea cca att tea aat 102 
Cys Asn Thr Gly Leu Val Leu Gly Leu Gly Pro Ser Pro lie Ser Asn 
10 15 20 

aat tac aat agt acc ate aga caa tec tec gtt tac aag etc gag ccg 150 
Asn Tyr Asn Ser Thr lie Arg Gin Ser Ser Val Tyr Lys Leu Glu Pro 
25 30 35 

teg ttg act eta tgc etc teg ggc gat ccc teg gtt acc gtg gtg acc 198 

Ser Leu Thr Leu Cys Leu Ser Gly Asp Pro Ser Val Thr Val Val Thr 
40 45 50 

15 

gga get gac cag eta tgc cgt cag acg tea tct cac age gga gtc tct 246 

Gly Ala Asp Gin Leu Cys Arg Gin Thr Ser Ser His Ser Gly Val Ser 

55 60 65 70 

tct ttc tea age ggg agg gtg gtg aaa aga gag aga gac ggt ggc gaa 2 94 

Ser Phe Ser Ser Gly Arg Val Val Lys Arg Glu Arg Asp Gly Gly Glu 
20 7 5 8 0 8 5 

gag teg ccg gag gag gaa gag atg acg gag aga gtt ata agt gat tac 342 
Glu Ser Pro Glu Glu Glu Glu Met Thr Glu Arg Val lie Ser Asp Tyr 
90 95 100 

25 cat gaa gat gaa gaa ggt att agt get aga aaa aaa ctt agg ctt acg 390 

His Glu Asp Glu Glu Gly lie Ser Ala Arg Lys Lys Leu Arg Leu Thr 
105 110 4 115 

aaa caa caa tct get ctt ctt gag gaa age ttc aag gat cat age acc 438 



35 



40 



45 



50 



55 
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10 



20 



25 



50 



55 



Lys Gin Gin Ser Ala Leu Leu Glu Glu Ser Phe Lys Asp His Ser Thr 
120 125 130 

ctt aat ccc aaa caa aag caa gtt ctg get aga cag ctg aat eta agg 486 
Leu Asn Pro Lys Gin Lys Gin Val Leu Ala Arg Gin Leu Asn Leu Arg 
135 140 145 150 

cct aga caa gtt gaa gta tgg ttt caa aat aga aga gec agg aca aag 534 
Pro Arg Gin Val Glu Val Trp Phe Gin Asn Arg Arg Ala Arg Thr Lys 
155 160 165 

ctg aag caa aca gaa gta gat tgt gag ttt ttg aag aag tgt tgt gaa 582 
Leu Lys Gin Thr Glu Val Asp Cys Glu Phe Leu Lys Lys Cys Cys Glu 
170 175 180 

aca tta gca gat gag aac ata aga ctt cag aaa gag att caa gaa etc 630 
Thr Leu Ala Asp Glu Asn lie Arg Leu Gin Lys Glu lie Gin Glu Leu 
185 190 195 

aaa acc eta aaa ttg act cag ccc ttt tac atg cac atg cct gca teg 678 
Lys Thr Leu Lys Leu Thr Gin Pro Phe Tyr Met His Met Pro Ala Ser 
200 205 210 

act eta acg aag tgt cct tct tgt gag aga ate ggc ggc ggc ggc ggg 726 
Thr Leu Thr Lys Cys Pro Ser Cys Glu Arg lie Gly Gly Gly Gly Gly 
215 220 225 230 

ggt aat gga gga gga ggt ggc ggc age ggg get acc gcg gtg att gta 774 
Gly Asn Gly Gly Gly Gly Gly Gly Ser Gly Ala Thr Ala Val He Val 
235 240 245 

gat gga agt acg gee aaa gga get ttc tct ate tec tea aag cct cac 822 
Asp Gly Ser Thr Ala Lys Gly Ala Phe Ser He Ser Ser Lys Pro His 
250 " 255 260 

ttc ttc aac cct ttt act aac cca tct gca get tgt tga atagttaatt 871 
Phe Phe Asn Pro Phe Thr Asn Pro Ser Ala Ala Cys 
265 270 

cgtttaattt tattacttaa aatattaatt ttcttttttt ttttgggtgg catttt 927 



35 <210>54 
<211>274 
<212> PRT 

<213> Arabidopsis thaliana 
40 <400> 54 
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Met Gly Phe Asp Asp Thr Cys Asn Thr Gly Leu Val Leu Gly Leu Gly 
15 10 15 

Pro Ser Pro lie Ser Asn Asn Tyr Asn Ser Thr lie Arg Gin Ser Ser 
20 25 30 

Val Tyr Lys Leu Glu Pro Ser Leu Thr Leu Cys Leu Ser Gly Asp Pro 
35 40 45 

Ser Val Thr Val Val Thr Gly Ala Asp Gin Leu Cys Arg Gin Thr Ser 
50 55 60 

Ser His Ser Gly Val Ser Ser Phe Ser Ser Gly Arg Val Val Lys Arg 
65 70 75 80 

Glu Arg Asp Gly Gly Glu Glu Ser Pro Glu Glu Glu Glu Met Thr Glu 
85 90 95 

Arg Val lie Ser Asp Tyr His Glu Asp Glu Glu Gly lie Ser Ala Arg 
100 105 110 



Lys Lys Leu Arg Leu Thr Lys Gin Gin Ser Ala Leu Leu Glu Glu Ser 
115 120 125 

Phe Lys Asp His Ser Thr Leu Asn Pro Lys Gin Lys Gin Val Leu Ala 
130 135 140 

Arg Gin Leu Asn Leu Arg Pro Arg Gin Val Glu Val Trp Phe Gin Asn 
145 150 155 160 

Arg Arg Ala Arg Thr Lys Leu Lys Gin Thr Glu Val Asp Cys Glu Phe 
165 170 175 

Leu Lys Lys Cys Cys Glu Thr Leu Ala Asp Glu Asn lie Arg Leu Gin 
180 185 190 

Lys Glu lie Gin Glu Leu Lys Thr Leu Lys Leu Thr Gin Pro Phe Tyr 
195 200 205 

Met His Met Pro Ala Ser Thr Leu Thr Lys Cys Pro Ser Cys Glu Arg 
210 215 * ^ 220 

lie Gly Gly Gly Gly Gly Gly Asn Gly Gly Gly Gly Gly Gly Ser Gly 
225 230 235 240 

Ala Thr Ala Val lie Val Asp Gly Ser Thr Ala Lys Gly Ala Phe Ser 
245 250 255 

lie Ser Ser Lys Pro His Phe Phe Asn Pro Phe Thr Asn Pro Ser Ala 
260 265 270 



Ala Cys 



106 



EP 1 230 345 B1 



Claims 

1 . A transgenic plant having greater tolerance to an environmental stress than a wild-type plant of the same species, 
wherein the environmental stress is selected from the group consisting of drought, salt, heat and cold, 

wherein the transgenic plant comprises an expression vector or cassette, and wherein the expression vector or 
cassette comprises a polynucleotide having a sequence encoding a polypeptide having a conserved domain with 
at least 80% sequence identity to the conserved domain of amino acid residues 25 to 116 of SEQ ID NQ:16, 
and wherein overexpression of the polypeptide in the transgenic plant increases the tolerance of the transgenic 
plant to the environmental stress as compared to the tolerance of the wild-type plant to drought, salt, heat or cold. 

2. The transgenic plant of claim 1 , wherein the polynucleotide has a sequence comprising SEQ ID NO:15. 

3. The transgenic plant of any one of claims 1 or 2, wherein the expression vector or cassette further comprises a 
constitutive, inducible or tissue-specific promoter operably linked to the polynucleotide having said sequence. 

4. The transgenic plant of any one of claims 1 to 3, wherein the transgenic plant has greater tolerance to salt than the 
wild-type plant. 

5. The transgenic plant of any one of claims 1 to 4, wherein the plant is selected from the group consisting of soybean, 
wheat, corn, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, banana, blackberry, blueberry, 
strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, man- 
go, melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, 
rosaceous fruits and vegetable brassicas. 

6. The transgenic plant of any one of claims 1 to 5, wherein the transgenic plant is more tolerant than the wild type 
plant to a six-hour exposure to 200 mM NaCI. 

7. A seed obtainable from the transgenic plant of any of claims 1 to 6, wherein the seed comprises the expression 
vector or cassette comprising the polynucleotide having said sequence. 

8. Use of an expression vector or cassette to produce a transformed plant, wherein the expression vector or cassette 
encodes a polypeptide having a conserved domain with at least 80% sequence identity to the conserved domain 
of amino acid residues 25 to 116 of SEQ ID NO: 16, and overexpression of the polypeptide in the transformed plant 
increases tolerance of the transformed plant to heat, drought, salt and/or cold as compared to the tolerance of the 
wild-type plant. 

9. Use of a polynucleotide to produce a transformed plant, wherein the polynucleotide encodes a polypeptide having 
a conserved domain with at least 80% sequence identity to a conserved domain of amino acid residues 25 to 116 
of SEQ ID NO:16, the polynucleotide is incorporated into an expression vector or cassette, the expression vector 
or cassette is introduced into a plant to produce the transformed plant, and the transformed plant has increased 
tolerance to heat, drought, salt and/or cold as compared to the tolerance of the wild type plant. 

1 0. A method for producing a plant having increased tolerance to heat, drought, salt and/or cold, the method comprising 
the steps of: 

(a) providing an expression vector or cassette comprising an isolated or recombinant polynucleotide encoding 
a polypeptide that comprises a conserved domain with at least 80% sequence identity to a conserved domain 
of amino acid residues 25 to 1 16 of SEQ ID NO: 16; and 

(b) introducing the expression vector or cassette into the plant to produce a transformed plant, wherein 

overexpression of the polypeptide in the transformed plant increases tolerance of the transformed plant to heat, 
drought, salt and/or cold as compared to the tolerance of a wild-type plant to drought, salt, heat and/or cold. 

1 1 . The method of claim 1 0, wherein the transformed plant is more tolerant than the wild type plant to a six-hou r exposure 
to 200 mM NaCI. 

1 2. The method according to claim 1 0 or 1 1 , wherein the expression vector or cassette comprises a constitutive, inducible 
or tissue-specific promoter. 
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Patentanspruche 

1 . Transgene Pf lanze mit ho hererToleranz gegenuber Umweltbelastungen als eine Wildtyppflanze derselben Spezies, 
worin die Umweltbelastung aus der aus Trockenheit, Salz, Hitze und Kalte bestehenden Gruppe ausgewahlt ist, 

5 worin die transgene Pflanze einen Expressionsvektor oder eine Expressionskassette umfasst und worin der Ex- 

pressionsvektor oder die Expressionskassette ein Polynucleotid mit einer Sequenz umfasst, die fur ein Polypeptid 
kodiert, das eine konservierte Domane mit zumindest 80 % Sequenzidentitat mit der konservierten Domane der 
Aminosaurereste 25 bis 116 der Seq.-ID Nr. 1 6 aufweist, 

und worin eine Uberexpression des Polypeptids in der transgenen Pflanze die Toleranz der transgenen Pflanze 
10 gegenuber Umweltbelastungen im Vergleich zur Toleranz der Wildtyppflanze gegenuber Trockenheit, Salz, Hitze 

oder Kalte steigert. 

2. Transgene Pflanze nach Anspruch 1, worin das Polynucleotid eine Sequenz aufweist, die Seq.-ID Nr. 15 umfasst. 

is 3. Transgene Pflanze nach einem der Anspruche 1 bis 2, worin der Expressionsvektor oder die Expressionskassette 
weiteres einen konstitutiven, induzierbaren oder gewebespezifischen Promotor umfasst, der operabel an das Po- 
lynucleotid mit der genannten Sequenz gebunden ist. 

4. Transgene Pflanze nach einem der Anspruche 1 bis 3, worin die transgene Pflanze hohere Toleranz gegenuber 
20 Salz aufweist als die Wildtyppflanze. 

5. Transgene Pflanze nach einem der Anspruche 1 bis 4, worin die Pflanze aus der aus Sojabohne, Weizen, Mais, 
Kartoffel, Baumwolle, Reis, Raps, Sonnenblume, Luzerne, Zuckerrohr, Rasen, Banane, Brombeere, Heidelbeere, 
Erdbeere, Himbeere, Cantaloupe-Melone, Karotte, Karfiol, Kaffee, Gurke, Melanzani, Weintrauben, Honigmelone, 

25 Gartensalat, Mango, Zuckermelone, Zwiebel, Papaya, Erbsen, Paprika, Ananas, Spinat, Kurbis, SuBmais, Tabak, 

Tomate, Wassermelone, Rosengewachs-Obstbaume und Gemusekohlsorten bestehenden Gruppe ausgewahlt ist. 

6. Transgene Pflanze nach einem der Anspruche 1 bis 5, worin die transgene Pflanze gegenuber einer sechsstundigen 
Aussetzung gegenuber 200 mM NaCI toleranter ist als die Wildtyppflanze. 

30 

7. Samen, der von einer transgenen Pflanze nach einem der Anspruche 1 bis 6 erhaltlich ist, worin der Samen den 
Expressionsvektor oder die Expressionskassette umfasst, der/die das Polynucleotid mit der genannten Sequenz 
umfasst. 

35 8. Verwendung eines Expressionsvektors oder einer Expressionskassette, urn eine transformierte Pflanze herzustel- 
len, worin der Expressionsvektor oder die Expressionskassette fur ein Polypeptid kodiert, das eine konservierte 
Domane mit zumindest 80 % Sequenzidentitat mit der konservierten Domane der Aminosaurereste 25 bis 1 1 6 der 
Seq.-ID Nr. 16 aufweist, und eine Uberexpression des Polypeptids in dertransformierten Pflanze die Toleranz der 
transformierten Pflanze gegenuber Hitze, Trockenheit, Salz und/oder Kalte im Vergleich zur Toleranz der Wild- 

40 typpflanze steigert. 

9. Verwendung eines Polynucleotids zur Herstellung einer transformierten Pflanze, worin das Polynucleotid fur ein 
Polypeptid kodiert, das eine konservierte Domane mit zumindest 80 % Sequenzidentitat mit der konservierten 
Domane der Aminosaurereste 25 bis 1 1 6 der Seq.-ID Nr. 1 6 aufweist, das Polynucleotid in einen Expressionsvektor 

45 oder eine Expressionskassette inkorporiert ist, der Expressionsvektor oder die Expressionskassette in eine Pflanze 

eingefuhrtwird, um die transformierte Pflanze herzustellen, und die transformierte Pflanze im Vergleich zurToleranz 
der Wildtyppflanze gesteigerte Toleranz gegenuber Hitze, Trockenheit, Salz und/oder Kalte aufweist. 

10. Verfahren zur Herstellung einer Pflanze mit gesteigerter Toleranz gegenuber Hitze, Trockenheit, Salz und/oder 
50 Kalte, wobei das Verfahren folgenden Schritte umfasst: 

(a) das Bereitstellen eines Expressionsvektors oder einer Expressionskassette, der/die ein isoliertes oder re- 
kombinantes Polynucleotid umfasst, das fur ein Polypeptid kodiert, welches eine konservierte Domane mit 
zumindest 80 % Sequenzidentitat mit der konservierten Domane der Aminosaurereste 25 bis 1 1 6 von Seq.-ID 

55 Nr. 16 aufweist; und 

(b) das Einfuhren des Expressionsvektors oder der Expressionskassette in die Pflanze, um eine transformierte 
Pflanze herzustellen, worin 
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eine Uberexpression des Polypeptids in der transformierten Pflanze die Toleranz der transformierten Pflanze ge- 
genuber Trockenheit, Salz, Hitze und/oder Kalte im Vergleich zur Toleranz der Wildtyppflanze Trockenheit, Salz, 
Hitze und/oder Kalte steigert. 

11. Verfahren nach Anspruch 10, worin die transformierte Pflanze gegenuber einer sechsstundigen Aussetzung ge- 
genuber 200 mM NaCI toleranter ist als die Wildtyppflanze. 

12. Verfahren nach Anspruch 10oder 1 1 , worin der Expressionsvektoroderdie Expressionskassette einen konstitutiven, 
induzierbaren oder gewebespezifischen Promoter umfasst. 



Revendications 

1. Plante transgenique ayant une plus grande tolerance vis-a-vis d'un stress environnemental qu'une plante du type 
sauvage de la meme espece, 

ou le stress environnemental est selectionne dans le groupe consistant en secheresse, sel, chaleur et froid, 
ou la plante transgenique comprend un vecteur d'expression ou cassette et ou le vecteur d'expression ou cassette 
comprend un polynucleotide ayant une sequence codant pour un polypeptide ayant un domaine conserve avec au 
moins 80% d'identite de sequence avec le domaine conserve de residus d'acides amines 25 a 1 1 6 de SEQ ID NO:1 6; 
et ou la surexpression du polypeptide dans la plante transgenique augmente la tolerance de la plante transgenique 
au stress environnemental en comparaison avec latolerance de la plante du type sauvage vis-a-vis de la secheresse, 
du sel, de la chaleur ou du froid. 

2. Plante transgenique de la revendication 1 , ou le polynucleotide a une sequence comprenant SEQ ID NO:1 5. 

3. Plante transgenique de Tune quelconque des revendications 1 ou 2, ou le vecteurd'expression ou cassette comprend 
de plus un promoteurconstitutif inductible specifique du tissu operativement enchatne au polynucleotide ayant ladite 
sequence. 

4. Plante transgenique de Tune quelconque des revendications 1 a 3, ou la plante transgenique a une plus grande 
tolerance vis-a-vis d'un sel que la plante du type sauvage. 

5. Plante transgenique de Tune quelconque des revendications 1 a 4, ou la plante est selectionnee dans le groupe 
consistant en soja, ble, maTs, pomme de terre, coton, riz, colza, tournesol, luzerne, canne asucre, gazon, banane, 
mure, myrtille, fraise, framboise, cantaloup, carotte, choux-fleur, cafe, concombre, aubergine, raisin, honeydew, 
laitue, mangue, melon, oignon, papaye, pois, poivre, ananas, epinard, patisson, ma!s doux, tabac, tomate, melon 
d'eau, fruits rosaces, brasicas vegetaux. 

6. Plante transgenique de Tune quelconque des revendications 1 a 5, ou la plante transgenique est plus tolerante que 
la plante du type sauvage vis-a-vis d'une exposition de six heures a NaCI 200 nM. 

7. Graine obtenue a partir de la plante transgenique de Tune quelconque des revendications 1 a 6, ou la graine 
comprend le vecteur d'expression ou cassette comprenant le polynucleotide ayant ladite sequence. 

8. Utilisation d'un vecteur d'expression ou d'une cassette pour produire une plante transformee ou le vecteur d'ex- 
pression ou cassette code pour un polypeptide ayant un domaine conserve avec au moins 80% d'identite de se- 
quence avec le domaine conserve des residus d'acides amines 25 a 1 16 de SEQ ID NO:1 6 et la surexpression du 
polypeptide dans la plante transformee augmente la tolerance de la plante transformee a la chaleur, a la secheresse, 
au sel et/ou au froid en comparaison avec la tolerance du type sauvage. 

9. Utilisation d'un polynucleotide pour produire une plante transformee ou le polynucleotide code pour un polypeptide 
ayant un domaine conserve avec au moins 80% d'identite de sequence avec un domaine conserve de residus 
d'acides amines 25 a 116 de SEQ ID NO:16, le polynucleotide est incorpore dans un vecteur d'expression ou 
cassette, le vecteur d'expression ou cassette est introduit dans une plante pour produire la plante transformee et 
la plante transformee a une tolerance accrue a la chaleur, a la secheresse, au sel et/ou au froid en comparaison 
avec la tolerance de la plante du type sauvage. 

10. Methode de production d'une plante ayant une tolerance accrue a la chaleur, a la secheresse, au sel et/ou au froid, 
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la methode comprenant les etapes de : 

(a) prevoir un vecteur d'expression ou cassette comprenant un polynucleotide isole ou recombinant codant 
pour un polypeptide qui comprend un domaine conserve avec au moins 80% d'identite de sequence avec un 

5 domaine conserve des residus d'acides amines 25 a 11 6 de SEQ ID NO:1 6 ; et 

(b) introduire le vecteur d'expression ou cassette dans la plante pour produire une plante transformee, ou 

la surexpression du polypeptide dans la plante transformee augmente la tolerance de la plante transformee a la 
chaleur, la secheresse, le sel et/ou le froid en comparaison avec la tolerance d'une plante du type sauvage a la 
10 secheresse, le sel, la chaleur et/ou le froid. 

11. Methode de la revendication 10, ou la plante transformee est plus tolerance que la plante du type sauvage a une 
exposition de six heures a NaCI 200 nM. 

is 1 2. Methode selon la revendication 1 0 ou 11 , ou le vecteur d'expression ou cassette comprend un promoteur constitutif 
inductible specifique du tissu. 

20 
25 
30 
35 
40 
45 
50 
55 
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Figure 1 



SEQ ID No. 


GID 


cDNA or protein 


conserved domain 




G22 


cDNA 




2 


G22 


protein 


89-157 


3 


G188 


cDNA 




4 


G188 


protein 


175-222 


! 5 


G225 


cDNA 




I 6 


G225 


protein 


39-76 


7 


G226 


cDNA 




; 8 


G226 


protein 


28-78 


9 


G256 


cDNA 




10 


G256 


protein 


13-115 


11 


G419 


cDNA 




12 


G419 


protein 


392-452 


13 


G464 


cDNA 




I 14 


G464 


protein 


7-1 5,70-80, 1 25-1 58, 1 83-2 1 9 


I 15 


G482 


cDNA 




i 16 


G482 


protein 


25-116 


i 17 


G502 


cDNA 




18 


G502 


protein 


10-155 


19 


G526 


cDNA 




! 20 


G526 


protein 


21-149 


i 21 


G545 


cDNA 




! 22 


G545 


protein 


82-102, 136-154 


S 23 


G561 


cDNA 




I 24 


G561 


protein 


248-308 


25 


G664 


cDNA 




26 


G664 


protein 


13-116 


27 


G682 


cDNA 




28 


G682 


protein 


22-53 


29 


G911 


cDNA 




30 


G911 


protein 


86-129 


! 31 


G964 


cDNA 




I 32 


G964 


protein 


126-186 


! 33 


G394 


cDNA 




! 34 


G394 


protein 


121-182 


| 35 


G489 


cDNA 




36 


G489 


protein 


57-156 



111 



EP 1 230 345 B1 



Figure 2 



SEQ ID No. 


GID 


homolog^ 


cDNA or protein 


conserved domain 


37 


G463 


homolog of G464 


cDNA 




38 


G463 


homolog of G464 


protein 


14-23, 77-88, 130-146, 194-227 


39 


G767 


homolog of G502 


cDNA 




40 


G767 


homolog of G502 


protein 


8-158 


41 


G765 


homolog of G526 


cDNA 




42 


G765 


homolog of G526 


protein 


23-167 


43 


G197 


homolog of G664 


cDNA 




44 


G197 


homolog of G664 


protein 


14-119 


45 


G255 


homolog of G664 


cDNA 




46 


G255 


homolog of G664 


protein 


14-115 


47 


G1113 


homolog of G911 


cDNA 




48 


G1113 


homolog of G911 


protein 


85-128 


49 


G398 


homolog of G964 


cDNA 




50 


G398 


homolog of G964 


protein 


128-191 


51 


G395 


homolog of G394 


cDNA 




52 


G395 


homolog of G394 


protein 


72-135 


53 


G393 


homolog of G394 


cDNA 




54 


G393 


homolog of G394 


protein 


106-169 
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Figure 3A 



ccn in Ma 


GID 


Genbank NID 


l vaiuc 


Species 


1 




TQAOCG 

f yujoy 


1 .UUfc-40 


iNicouana laoacurn 


1 


poo 


QO/lOO^ A 

oo4ZZ1U 


b.bUt-4o 


Lycopersicon escuienium 


1 


poo 


OOu4/ f D 


1 .bUfc-44 


ti A n*4 1 r**-< /-1 y\ tri i A A£Stl it 9 

JvieUlCatjO llUl lOdlllla 


1 


POO 

G2Z 


68G9570 


C OAC /I > 

5.80E-44 


iNi co iiana syivesins 


1 


POO 

G2Z 


■7/507AC i 


0 vi nr 'So 
Z.4Ut-oy ! 


oossypium aruoreum 


1 


POO 

(322 


/o^44f y 


ft CAC O/S 


1 \//«Ar\Aro i/^An pk »^ e\ 1 1 1 1 

Lycopersicon penneiui 


- . 1 


G22 


QADAO 4 O 

89B0312 


A OAC 04 

4.30E-31 


uatnarantnus roseus 


1 


G22 


7528275 


A AAC OA 

1 .20E-30 


MesenriDryantnemurn crystal i mum 


1 


POO 

G22 


G A "7QQAA 

04/0044 


a cine op 
4.bUt-^o 


Mairicana cnamorniiia 


1 - 


G22 


6847348 


C ft AC OC 

5.90E-26 


Glycine max 


3 


G188 


7779802 


e O ACT 0£> 

5.20E-36 


Lotus japonicus 


3 


G188 


7284340 


2.10E-34 


Glycine max 


3 


G188 


9361307 


1 .20E-27 


Triticum aestivum 


3 


G188 


7340336 


1.10E-22 


Oryza sativa 


3 


G188 


6529152 


O ffAC ft A 

3.60E-22 


Lycopersicon esculentum 


3 


G188 


0T>I0^77 

8748477 


-T -T/-\f— 04 

7.70E-21 


Medicago truncatula 


3 


G188 


5456433 


7 4 AC 4 > 

7.10E-14 


Zea mays 


3 


G188 


9302479 


1 .bOE-lz 


Sorghum bicolor 


3 


G188 


6696287 


A A AC 4 O 

4.10E-12 


Pinus taeda 


3 


G188 


562242 


A AAC A O 

9.00E-12 


Brass ica rapa 


5 


G225 


4396287 


4.40E-16 


Glycine max 


5 


G225 


309571 


S\ rtAAAA 

0.00029 


Zea mays 


5 


G225 


3857004 


0.001 


Populus tremula x Populus tremuloides 


5 


G225 


A A A rtOfl C 

9410205 


A A A A 

0.019 


Triticum aestivum 


5 


G225 


9426190 


0.025 


Triticum turgidum subsp. durum 


5 


G225 


838211 8 


A f\AC 

0.046 


Gossypium arboreum 


5 


G225 


6782756 


0.27 


Oryza sativa 


5 


G225 


7721017 


0.4 


Lotus japonicus 


5 


G225 


6020136 


0.47 


Pinus taeda 


5 


G225 


2921331 


0.48 


Gossypium hirsutum 


7 


G226 


4396287 


r- a AC 4 C 

5.10E-15 


Glycine max 


7 


G226 


9410205 


1.50E-05 


Triticum aestivum 


7 


G226 


3857004 


0.1 1 


Populus tremula x Populus tremuloides 


7 


G226 


Pi i n A i #^ p* 

2428139 


0.35 


Oryza sativa 


9 


G256 


1430847 


1 .30E-72 


Lycopersicon esculentum 


9 


G256 


nnr/l A A 4 

9252441 


a OAC CC 

1 -20E-DO 


Solarium tuberosum 


9 


G256 


8380712 


ft OAC CO 

2.20E-58 


Gossypium arboreum 


9 


G256 


O 4 "7 A AT 

8172976 


a r? AC IT yi 

1 .60E-54 


Medicago truncatula 


9 


G256 


9205295 


1 .oOh-44 


Glycine max 


9 


G256 


20562 


d A AC /I A 

b.4Ut-4U 


Petunia x hybrida 


i 9 


G256 


48B6263 


A A AC 07 

4.4Ufc-o7 


Antirrhinum majus 


9 


G256 


6552360 


C AAC O^i 


Nicotiana tabacum 


9 


G256 


2312003 


4 OAC O C 

1 .20E-35 


Oryza sativa 


9 


G256 


5268628 


c oac on 


Zea mays 


A A 
I 1 




700Q1CR 


i.OUL JC7 


IVlalUO A Wvll IICOllvO 


11 


G419 


5278451 


9.00E-58 


Lycopersicon esculentum 


11 


G419 


9205496 


1 .30E-55 


Glycine max 


11 


G419 


7628137 


9.30E-51 


Gossypium arboreum 


11 


G419 


6069643 


9.50E-51 


Oryza sativa 


I 11 


G419 


7562931 


9.80E-45 


Medicago truncatula 


11 


G419 


7322293 


2-30E-37 


Lycopersicon hirsutum 


I 11 


G419 


8404716 


1.10E-29 


Hordeum vulgare 


11 


G419 


7217755 


1.40E-29 


Sorghum bicolor 
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Figure 3B 



SEQ ID No. 


GID 


Genbank NID 


P- value 


Species 


11 


G419 


9428023 


4.60E-28 


Triticum aestivum 


13 


G464 


6527230 


3.60E-31 


Lycopersicon esculentum 


13 


G464 


9305572 


1.10E-22 


Sorghum bicolor 


13 


G464 


6604917 


6.70E-22 


Medicago truncatula 


13 


G464 


5058123 


2.30E-21 


Glycine max 


13 


G464 


3760881 


1.20E-19 


Oryza sativa 


13 


G464 


5044476 


1.20E-17 


Gossypiurn hirsutum 


13 


G464 


9412603 


6.40E-15 


Triticum aestivum 


13 


G464 


7777277 


3.20E-13 


Lotus japonicus 


13 


G464 


9410371 


1.70E-11 


Hordeum vulgare 


13 


G464 


7624108 


2.10E-10 


Gossypiurn arboreum 


15 


G482 


7691987 


5.50E-50 


Glycine max 


15 


G482 


7781090 


1 .30E-48 


Lotus japonicus 


15 


G482 


7409616 


1.10E-47 


Lycopersicon esculentum 


15 


G482 


9416562 


4.40E-46 


Triticum aestivum 


15 


G482 


22379 


2.30E-44 


Zea mays 


15 


G482 


7501372 


7.70E-44 


Gossypiurn arboreum 


15 


G482 


7765436 


8.40E-42 


Medicago truncatula 


15 


G482 


5044464 


1 .20E-40 


Gossypiurn hirsutum 


15 


G482 


9441376 


9.20E-40 


Chlamydomonas reinhardtii 


15 


G482 


8071558 


3.50E-39 


Solanum tuberosum 


17 


G502 


6730941 


1 .60E-91 


Oryza sativa 


17 


G502 


7765679 


1 60E-82 


Medicago truncatula 


17 


G502 


7502501 


7.30E-80 


Gossypiurn arboreum 


17 


G502 


5510359 


8.30E-77 


Glycine max 


17 


G502 


5601137 


8.70E-76 


Lycopersicon esculentum 


17 


G502 


9302206 


1 40E-73 


Sorghum bicolor 


17 


G502 


4089948 


3.40E-50 


Brassica napus 


17 


G502 


8329134 


7.90E-49 


Mesembryanthemum crystallinum 


I 17 


G502 


7723564 


8.60E-49 


Lotus japonicus 


17 


G502 


4218534 


1.80E-48 


Triticum sp. 


19 


G526 


5049217 


3.40E-61 


Gossypiurn hirsutum 


19 


G526 | 


6066594 


1.50E-55 


Petunia x hybrida 


19 


G526 


4384535 


1.50E-54 


Lycopersicon esculentum 


19 


G526 


6454868 


6.60E-54 


Glycine max 


19 


G526 


4977542 


4.70E-52 


Oryza sativa 


19 


G526 


5343151 


7.00E-51 


Zea mays 


19 


G526 


9361647 


5.10E-50 


Triticum aestivum 


19 


G526 


6799764 


4.30E-48 


Medicago truncatula 


19 


G526 


8708684 


1 .80E-47 


Hordeum vulgare 


19 


G526 


4218536 


3.60E-47 


Triticum sp. 


21 


G545 


4666359 


8.30E-55 


Datisca glome rata 


21 


G545 


7228328 


3.70E-52 


Medicago sativa 


21 


G545 


1763062 


1.30E-51 


Glycine max 


21 


G545 


7206360 


3.10E-44 


Medicago truncatula 


21 


G545 


7626808 


9.60E-40 


Gossypiurn arboreum 


21 


G545 


439492 


3.90E-39 


Petunia x hybrida 


21 


G545 


4382658 


1.70E-38 


Lycopersicon esculentum 


21 


G545 


8486215 


8.70E-38 


Euphorbia esula 


21 


G545 


7322653 


6.80E-37 


Lycopersicon hirsutum 


21 


G545 


7785845 


1.10E-33 


Lotus japonicus 


23 


G561 


2995461 


5.60E-86 


Sinapis alba 


23 


G561 


633153 


6.50E-83 


Brassica napus 
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Figure 3C 



SEQ ID No. 


GID 


Genbank NID| P-value 


Species 


23 


G561 


1033058 


5.90E-65 


Raphanus sativus 


23 


G561 


2815304 


2.10E-35 


Spinacia oieracea 


23 


G561 


1498300 


1.60E-34 


Petroselinum crispum 


23 


I G561 


169958 


8.10E-32 


Glycine max 


| 23 


G561 


5381310 


2.20E-30 


Catharanthus roseus 


S 23 


G561 


1 1 55053 


| 9.70E-28 


Phaseolus vulgaris 


23 


G561 


728627 


1 .90E-27 


Nicotiana tabacum 


! 23 


G561 


7565950 


1.40E-21 


Medicago truncatula 


25 


G664 


1167483 


4.90E-81 


Lycqpersicon esculentum 


25 


G664 


7765706 


6.30E-69 


Medicago truncatula 


i 25 


G664 


19052 


9.30E-68 


Hordeum vulgare 


25 


G664 


7626566 


4.00E-67 


Gossypium arboreum 


25 


G664 


5050757 


2.60E-66 


Gossypium hirsutum 


25 


G664 


6650206 


6.90E-66 


Oryza sativa 


25 


G664 


6667606 


2.20E-63 


Glycine max 


25 


G664 


517492 


9.30E-62 


Zea mays 


25 


G664 


9302672 


1 .50E-59 


Sorghum bicolor 


25 


G664 


5860031 


9.20E-58 


Pinus taeda 


27 


G682 


309571 


4.40E-08 


Zea mays 


27 


G682 


4396287 


1.10E-05 


Glycine max 


27 


G682 


3857004 


0.00051 


Populus tremula x Populus tremuloides 


27 


G682 


9410205 


0.00085 


Trlticum aestivum 


27 


G682 


8382118 


0.0079 


Gossypium arboreum 


27 


G682 


2428139 


0.017 


Oryza sativa 


27 


G682 


7339148 


0.13 


Lycopersicon esculentum 


27 


G682 


9302672 


0.32 


Sorghum bicolor 


27 


G682 


5048991 


0.39 


Gossypium hirsutum 


27 


G682 


6555777 


0.46 


Pinus taeda 


29 


G911 


4090113 


6.10E-51 


Brassica napus 


29 


G911 


5893315 


7.70E-25 


Lycopersicon esculentum 


29 


G911 


5048452 


3.10E-23 


Gossypium hirsutum 


29 


G911 


9440241 


1 .90E-21 


Glycine max 


29 


G911 


6917169 


1.80E-11 


Lycopersicon pennellii 


29 


G911 


9297970 


3.20E-11 


Sorghum bicolor | 


29 


G911 


7137594 


4.90E-11 


Zea mays 


29 


G911 


9278447 


4.60E-10 


Lotus japonicus 


29 


G911 | 


7560271 


7.20E-10 


Medicago truncatula 


29 


G911 


5043346 


4.50E-09 


Sorghum halepense 


31 


G964 


7624806 


3.30E-72 


Gossypium arboreum 


31 


G964 


1234899 


9.10E-66 


Glycine max 


31 


G964 


1149534 


1 .50E-61 


Pimpinella brachycarpa 


31 


G964 


8919872 


3.40E-51 


Capsella rubella 


31 


G964 


992597 


6J0E-51 


Lycopersicon esculentum 


31 


G964 


1235564 


1.50E-38 


Oryza sativa ! 


31 


G964 


6605613 


3.00E-32 | 


Medicago truncatula 


31 


G964 


1032371 


4.50E-28 


Helianthus annuus 


31 


G964 


3868846 i 


2.80E-25 


Ceratopteris richardii 


31 


G964 


8088109 


6.40E-22 


Sorghum bicolor 


33 


G394 


8670502 


7.90E-59 


Glycine max 


33 


G394 


3171738 


2.00E-54 


Craterostigma plantagineum 


33 


G394 


1032371 


1.10E-50 


Helianthus annuus 


33 | 


G394 


7624806 


4.30E-47 | 


Gossypium arboreum 


33 


G394 


1160483 


2.10E-46 


Pimpinella brachycarpa 
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Figure 3D 



SEQID No. 


GID 


Genbank NID 


P-value 


Species 


33 


G394 


3868846 


4.20E-45 


Ceratopteris richardii 


! 33 


G394 


992597 


1.10E-44 


Lycopersicon esculentum 


I 33 


G394 


755851 1 


1 50E-44 


Medicago truncatula 


! 33 


G394 


8099247 


6.20E-43 


Oryza sativa | 


33 


G394 


8919872 


1.20E-40 


Capselia rubella 


35 


G489 


6534956 


4.40E-62 


Lycopersicon esculentum 


35 


G489 


9055852 


2.60E-60 


Medicago truncatula 


| 35 


G489 


8382393 


6.20E-51 


Gossypium arboreum 


35 


G489 


8789169 


2.10E-50 


Citrus x paradisi 


35 


G489 


9252957 


1.50E-47 


Solanum tuberosum 


35 


G489 


6918056 


4.70E-47 


Lycopersicon pennellii 


35 


G489 


7590809 


1 .00E-46 


Glycine max 


35 


G489 


5257255 


8.60E-43 


Oryza sativa 


35 


G489 


4152190 


3.20E-41 


Zea mays 


35 


G489 


6069260 


2.10E-39 


Ceratodon purpureus 


37 


G463 


6527230 


4.90E-36 


Lycopersicon esculentum 


37 


G463 


9305572 


5.50E-36 


Sorghum bicolor 


37 


G463 


3760881 

W 1 Www V-J I 


1.20E-31 


Oryza sativa 


37 


G463 


6604917 


1 .30E-23 


Medicago truncatula 


37 


G463 


5058123 


2.50E-21 


Glycine max 


37 


G463 


5044476 


1.10E-19 


Gossypium hirsutum 


i 37 


G463 


9412603 


1.70E-17 


Triticum aestivum 


37 


G463 


9419394 


6.00E-17 


Hordeum vulgare 


37 


G463 


7624108 


6.20E-17 


Gossypium arboreum 


37 


G463 


8547152 


3.20E-16 


Nicotiana tabacum 


39 


G767 


5510359 


2.80E-76 


Glycine max 


39 


G767 


7643155 


4.20E-74 


Medicago truncatula 


39 


G767 


6977319 


1.10E-72 


Lycopersicon esculentum I 


39 


G767 


6730939 


4.20E-68 


Oryza sativa 


39 


G767 


7502501 


2.00E-67 


Gossypium arboreum 


39 


G767 


9302206 


3.10E-65 


Sorghum bicolor 


39 


G767 


4218534 


4.30E-51 


Triticum sp. 


39 


G767 


6732157 


4.30E-51 


Triticum monococcum 


39 


G767 


9412602 


6.90E-47 


Triticum aestivum 


39 


G767 


8329134 


1.30E-46 


Mesembryanthemum crystallinum 


41 


G765 


4384535 


3.10E-56 


Lycopersicon esculentum ! 


41 


G765 


6454868 


8.50E-56 


Glycine max 


41 


G765 


1279639 


4.30E-53 


Petunia x hybrida 


41 


G765 


4977542 


2.00E-51 


Oryza sativa 


41 


G765 


4218536 


2.00E-50 


Triticum sp. 


41 


G765 


6732159 


2.00E-50 


Triticum monococcum 


41 


G765 


5049217 


6.90E-50 


Gossypium hirsutum 


41 


G765 


9361647 


4.50E-49 


Triticum aestivum 


41 


G765 


9296257 


2.90E-48 


Sorghum bicolor 


41 


G765 


8708584 


4.30E-46 


Hordeum vulgare 


I 43 


G197 


1167483 


2.70E-76 


Lycopersicon esculentum 


43 


G197 


7626566 


2.40E-73 


Gossypium arboreum 


i 43 


G197 


7765706 


1.50E-63 


Medicago truncatula 


! 43 


G197 


19052 


8.90E-63 


Hordeum vulgare 


! 43 


G197 


5050757 


1.60E-62 


Gossypium hirsutum 


i 43 


G197 


6850206 


1.10E-61 


Oryza sativa 


I 43 


G197 


6667606 


1.70E-61 


Glycine max 


! 43 


G197 


517492 


7.60E-59 


Zea mays 
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Figure 3E 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


43 


G197 


5860031 


3.90E-57 


Pinus taeda 


43 


G197 


9302672 


3.80E-55 


Sorghum bicolor 


45 


G255 


1167483 


6.40E-75 


Lycopersicon esculentum 


45 


G255 


7626566 


6.40E-71 


Gossypium arboreum 


45 


G255 


19050 


2.80E-65 


Hordeum vulgare 


45 


G255 


5050757 


3.70E-63 


Gossypium hirsutum 


45 


G255 


7590249 


4.10E-62 


Glycine max 


45 


G255 


7765706 


4.40E-62 


Medicago truncatula 


45 


G255 


6850206 


1.10E-61 


Oryza sativa 


45 


G255 


517492 


3.50E-59 


Zea mays 


45 


G255 


9302672 


1.60E-56 


Sorghum bicolor 


45 


G255 


7721017 


2.60E-55 


Lotus japonicus 


47 


G1113 


4090113 


2.30E-36 


Brassica napus 


47 


G1113 


5048452 


6.80E-12 


Gossypium hirsutum 


47 


G1113 


5893315 


9.50E-11 


Lycopersicon esculentum 


47 


G1113 


9440241 


7.70E-09 


Glycine max 


49 


G398 


7624806 


2.80E-67 


Gossypium arboreum 


49 


G398 


1234899 


6.90E-64 


Glycine max 


49 


G398 


1149534 


6.20E-63 


Pimpinella brachycarpa 


49 


G398 


8919872 


2.60E-47 


Capsella rubella 


49 


G398 


992597 


1.10E-39 


Lycopersicon esculentum 


49 


G398 


1235564 


7.70E-39 


Oryza sativa 


49 


G398 


6605613 


1 .70E-33 


Medicago truncatula 


49 


G398 


8088109 


3.60E-33 


Sorghum bicolor 


49 


G398 


3868846 


1 .60E-32 


Ceratopteris richardii 


49 


G398 


3171738 


1.00E-27 


Craterostigma plantagineum 


51 


G395 


992597 


5.30E-51 


Lycopersicon esculentum 


51 


G395 


7624806 


2.00E-50 


Gossypium arboreum 


51 


G395 


1234899 


1.50E-49 


Glycine max 


51 


G395 


1165131 


1.90E-48 


Pimpinella brachycarpa 


51 


G395 


3868846 


3.40E-47 


Ceratopteris richardii 


51 


G395 


7415619 


1.30E-41 


Physcomitrella patens 


51 


G395 


8919872 


7.40E-41 


Capsella rubella 


51 


G395 


1235564 


2.70E-38 


Oryza sativa 


51 


G395 


8088109 


2.30E-33 


Sorghum bicolor 


51 


G395 


1032371 


3.30E-31 


Helianthus annuus 


| 53 


G393 


L 8670502 


3.60E-55 


Glycine max 


53 


G393 


9199975 


7.60E-46 


Medicago truncatula 


53 


G393 


3868846 


9.60E-37 


Ceratopteris richardii 


53 


G393 


8919872 


2.50E-35 


Capsella rubella 


53 


G393 


7624806 


1.30E-34 


Gossypium arboreum 


53 


G393 


7415619 


1.00E-33 


Physcomitrella patens 


53 


G393 


5897000 


5.50E-33 


Lycopersicon esculentum 


| 53 


G393 


1235564 


4.00E-32 


Oryza sativa 


53 


G393 


1165131 


6.40E-32 


Pimpinella brachycarpa 


53 


G393 


3171738 


1.50E-31 


Craterostigma plantagineum 
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